At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.
Job Description: Linux Server Platform Operations Engineer
Job Details
Job Posting Title:Linux Server Platform Operations Engineer
Location: Hyderabad, India
Job Profile:P3
Job Type: Full-Time, Permanent
Experience Level: 8+ years
Lilly’s Purpose
At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.
Job Summary:
We are seeking a highly skilled and experienced an L3 Linux Server Platform Operations Engineer to oversee the operations, management, and support of our enterprise Linux Server environment. The ideal candidate will be responsible for ensuring the stability, reliability, and performance of the Linux server infrastructure while leading a team of L1 & L2 Operations Engineers. This position requires in-depth technical expertise, leadership skills, and a proactive approach to problem-solving and operational excellence.
How You’ll Succeed
Be Bold –You will drive Infrastructure Operations to never have to fix the same problem twice through adoption of AI OPS, Event Driven Automation, and robust Observability.
Be Fast - You will accelerate initiatives in areas such as:Infrastructure AI OPS automation, cloud IaaS management, and cloud infrastructure as code to enable critical business projects.
Be Proactive - You will have groundbreaking chances to transform our operations processes using proactive, predictive, and automated AI & Observability capabilities.
Be Your Best - You will bring a high learning agility and Infrastructure operations / engineer skills to help us enable the Lilly Technology strategy, identifying tech opportunities, and accelerate our AI OPS journey.
Key Responsibilities:
Linux Server and Cluster Management:
Advanced expertise in Red Hat Enterprise Linux (RHEL), Ubuntu, Amazon Linux, and SUSE Linux Enterprise Server (SLES).
Experience in RHEL KVM and RHEL OpenShift, with a good understanding of containerized solutions such as Docker and Kubernetes, is highly desirable.
Experience managing General Parallel File System (GPFS) clusters and Pacemaker clusters for high availability.
Strong Linux network management and troubleshooting skills, including TCP/IP, DNS, DHCP, and firewall configurations and knowledge of Vlan.
VMware Vsphere Management, VMware hosted Linux server and Physical server (HP/DELL/IBM) management.
Storage management for Linux servers, including LVM, XFS,NFS,NAS etc.
Automation and Scripting:
Proficiency in writing Ansible playbooks for automation of system configurations and deployments.
Extensive experience with the Ansible Automation Platform, including Ansible Tower and AWX for centralized automation management.
Strong skills in Bash scripting for system management and automation tasks, including cron jobs and shell/python scripting.
Disaster Recovery (DR) and Zerto Tool:
Experience designing and implementing Disaster Recovery (DR) strategies, including backup and restore procedures.
Hands-on experience with Zerto for disaster recovery and business continuity, including Zerto Virtual Manager (ZVM) and Zerto Cloud Appliance (ZCA).
CI/CD and Cloud:
Experience building and managing Continuous Integration/Continuous Deployment (CI/CD) pipelines using GitHub Actions, Jenkins, or similar tools.
Advanced knowledge of Amazon Web Services (AWS) and Microsoft Azure cloud infrastructure and services, including EC2, S3, VPC, Azure Virtual Machines, and Azure Blob Storage.
Identity and Access Management:
Expertise in Centrify and Lightweight Directory Access Protocol (LDAP) integration for authentication and authorization.
Strong understanding of Red Hat Satellite for patching, system management, and content lifecycle management.
SOX Security Audit:
Experience in SOX (Sarbanes-Oxley) security audit steps for Linux servers, including access control, change management, data backup, and security monitoring.
Proficiency in implementing and maintaining SOX compliance for Linux environments, ensuring adherence to regulatory requirements.
24x7 Availability and Agile:
Availability for 24x7 support for mission-critical systems, including on-call rotations and incident management.
Experience working in Agile environments, including sprint planning, daily stand-ups, and retrospectives.
Documentation and Training:
Proficiency in creating technical documentation and Standard Operating Procedures (SOPs).
Ability to mentor and train junior team members, including knowledge transfer sessions and technical workshops.
Leadership skills
Lead, mentor, and guide a team of L1 and L2 engineers, fostering a culture of continuous learning and operational excellence.
Coordinate daily operational tasks, incident management, and problem resolution with the team.
Act as the primary escalation point for high-priority incidents and outages.
Security & Compliance
Ensure adherence to organizational security policies and regulatory compliance requirements.
Maintain and troubleshoot OS related configurations such as security updates, antivirus solutions, and vulnerability remediation on Linux Server systems.
Assist in periodic SoX and internal audits of server environments and configurations to identify and mitigate risks.
Stakeholder Collaboration
Work closely with business units, application teams, and other IT departments to address requirements, dependencies, and operational needs.
Communicate effectively with stakeholders, providing updates on operational performance, projects, and incidents.
Incident & Change Management
Manage incident resolution and root cause analysis for critical server issues.
Oversee change management processes, ensuring minimal impact to production environments.
Required Skills & Qualifications:
Technical Expertise
8+ years of experience managing enterprise-scale Linux Server environments.
Strong expertise in Red Hat Enterprise Linux (RHEL), Ubuntu, Amazon Linux, and SUSE Linux Enterprise Server (SLES).
In-depth knowledge of satellite patch management, automation (PowerShell scripting), configuration management, Ansible & backup solutions.
Proficiency in disaster recovery strategies, and high-availability configurations (e.g., clustering).
Familiarity with cloud technologies such as Azure or AWS (especially hybrid cloud environments).
Soft Skills
Strong analytical and troubleshooting skills, with the ability to handle complex technical challenges.
Proven leadership and team management experience, with excellent interpersonal and communication skills.
Ability to prioritize, multitask, and work effectively under pressure in a fast-paced environment.
Strong problem-solving and leadership abilities.
Effective communication and collaboration skills.
Education & Certifications
Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
Certifications: Red Hat Certified Engineer (RHCE), AWS Solutions Architect, or Azure Administrator.
Desirable Skills:
Experience in automating administrative tasks using tools such as Ansible, Terraform, or other DevOps tools.
Knowledge of ITSM tools (e.g., ServiceNow) and experience in ITIL-based processes.
Experience working in a regulated environment (e.g., healthcare, finance, pharmaceuticals).
Additional Information:
Role located in Hyderabad (relocation required)
Availability to work flexible work hours is/may be required. This team will support continuous operations across two shifts and therefore, this role will require non-standard work hours, and some work on weekends and holidays. Appropriate adjustments in benefits will be provided for employees working non-standard hours where applicable.
Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form (https://careers.lilly.com/us/en/workplace-accommodation) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.
Lilly does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.
#WeAreLilly