We are seeking an experienced Site Reliability Engineer II with a strong focus on CI/CD implementation and management in AWS environments. This mid-senior level position requires demonstrated expertise in building and maintaining reliable, secure, and scalable infrastructure. The ideal candidate will have a proven track record of implementing self-hosted solutions, containerization strategies, and managing offline server environments. Key Responsibilities:
Lead the design and implementation of CI/CD pipelines using AWS services and industry-standard tools
Architect, implement, and optimize self-hosted infrastructure components
Drive security initiatives and best practices across infrastructure and deployment pipelines
Mentor junior engineers on infrastructure and deployment best practices
Design and implement disaster recovery and high availability solutions
Lead technical discussions and architectural decisions for infrastructure improvements
Manage and optimize container orchestration platforms and Docker-based workflows
Implement and maintain offline development and deployment capabilities
Drive automation initiatives across the infrastructure
Participate in on-call rotation and lead incident response efforts
Required Skills and Qualifications:
Bachelor's degree in Computer Science, Engineering, or related field
2+ years of experience as an SRE with a focus on CI/CD and infrastructure
Strong Linux administration skills with proven production experience
Demonstrated experience with AWS services (ECS, ECR, EC2, RDS)
Proven track record of implementing and managing self-hosted CI/CD solutions
Strong security background with experience in implementing security controls
Hands-on experience with infrastructure as code (Terraform, CloudFormation)