الوصف الوظيفي
Location: Pune / Hybrid (Remote + Office)We are seeking an experienced Principal DevOps Engineer to lead and elevate our cloud infrastructure, DevOps practices, and site reliability engineering initiatives. The ideal candidate will have a proven track record of managing cloud platforms, container orchestration, modern CI/CD workflows, and automation solutions. This role requires a balance of technical depth, leadership skills, and collaboration across teams. The position is designed for those who thrive in a hybrid work environment and are excited to tackle complex infrastructure challenges.Key Responsibilities: • Cloud Infrastructure: Architect, deploy, and maintain scalable cloud-based infrastructures across Azure, AWS, or GCP. • Kubernetes & Containers: Design, deploy, and manage Kubernetes clusters and containerized environments with an emphasis on high availability, fault tolerance, and scalability. • CI/CD Pipeline Management: Lead the design and implementation of modern CI/CD pipelines using platforms like Tekton, ArgoCD, Jenkins, or similar. • Infrastructure as Code (IaC): Develop and manage infrastructure automation using tools such as Terraform, Ansible, and other IaC solutions to ensure consistency, scalability, and agility. • GenAI Integration: Extend Generative AI (GenAI) solutions to optimize cloud infrastructure and DevOps workflows, driving continuous improvements. • Performance & Cost Optimization: Continuously optimize cloud infrastructure for performance, cost efficiency, and security, ensuring sustainable scalability. • Troubleshooting: Troubleshoot and resolve complex infrastructure and application issues, minimizing downtime and ensuring high system reliability. • Scripting & Automation: Develop and maintain scripts and automation tools in Bash/Shell, Python, or Go to streamline operations. • Networking & Security: Apply networking fundamentals (DNS, firewalls, load balancing) to improve system architecture and ensure security compliance. • Collaboration & Mentorship: Collaborate with cross-functional teams to enhance developer productivity, system reliability, and operational efficiency. Mentor and guide team members, promoting best practices in DevOps and cloud operations. • Compliance & Monitoring: Ensure infrastructure and systems adhere to security, compliance, and monitoring standards, with a focus on proactive management.Qualifications: Experience:• 8+ years in DevOps, Cloud Engineering, or Site Reliability Engineering (SRE). • Proven experience in deploying and managing large-scale cloud infrastructure on Azure, AWS, or GCP. Technical Skills: • Kubernetes and containerization technologies (Docker, Helm, etc.) expertise. • Strong experience with modern CI/CD tools like Tekton, ArgoCD, Jenkins, or similar platforms. • Proficiency in scripting languages such as Bash/Shell and programming languages like Python or Go. • Hands-on experience with Terraform and Ansible for infrastructure automation and configuration management. • Solid understanding of Linux administration and networking principles (DNS, firewalls, load balancing, etc.). Other Skills: • Expertise in cloud performance optimization, cost control, and security best practices. • Ability to leverage Generative AI (GenAI) solutions to drive cloud and DevOps optimizations. • Excellent problem-solving and troubleshooting abilities, particularly in large-scale and distributed systems. Soft Skills: • Exceptional communication and collaboration skills, able to work across teams and geographies. • Strong leadership abilities with a focus on mentoring and fostering a culture of continuous improvement. • Adaptable to a hybrid work environment, balancing remote and office-based work. • Passionate about learning and staying up to date with emerging technologies in the DevOps and cloud ecosystem. Preferred Qualifications: • Certification in AWS, Azure, or GCP (e.g., AWS Certified DevOps Engineer, Azure Solutions Architect). • Experience with observability tools like Prometheus, Grafana, ELK stack, or Datadog. • Familiarity with microservices and service mesh technologies such as Istio or Linkerd. • Proven experience in leading large-scale migration or cloud transformation projects.Work Culture: • Ready to work in a hybrid environment that offers both remote flexibility and opportunities for in-office collaboration. • We support a culture of innovation, teamwork, and continuous learning, encouraging personal and professional growth within the organization. What We Offer: • Competitive salary and comprehensive benefits package. • Opportunities for career growth in a fast-paced, dynamic environment. • A collaborative team culture that values diversity and innovation