https://bayt.page.link/cfh6nh1G1G97YAPf8
Create a job alert for similar positions

Job Description

The Senior Site Reliability Engineer plays a crucial role within a small team, ensuring our critical services are secure, reliable, cost-effective, performant, and operationally excellent. This position demands a versatile professional who can contribute across development, system operations, resiliency testing, security hardening, and performance engineering. The ideal candidate is comfortable tackling new engineering challenges, conceptualising solutions, and implementing designs collaboratively. This role is pivotal in guiding our organisation towards modern application and infrastructure management practices while fostering the team's growth and skills development.


Key responsibilities include:


  • Addressing the most complex problems impacting the team's products
  • Developing innovative tools and processes to solve high-level challenges
  • Advocating for and modelling best practices, particularly for junior team members
  • Building trust and relationships with product development teams
  • Collaborating with development teams to diagnose and resolve systems issues
  • Mentoring junior team members in their SRE journey
  • Demonstrating deep knowledge across the team's product portfolio
  • Ensuring consistent process implementation across multiple applications

Minimum Qualifications:


  • Minimum of 6 years of relevant infrastructure development and software support experience
  • Experience architecting cloud-based solutions on AWS
  • Proficiency in managing cloud infrastructure on AWS
  • Strong familiarity with Linux operating systems
  • Proficiency in scripting languages like Ruby, Python, or Bash
  • Experience with Terraform

Nice to Have:


  • Experience with pipeline processes and implementations (e.g., Jenkins and Groovy)
  • Solid understanding of SDLC and Agile methodologies
  • Familiarity with cloud computing concepts, particularly AWS
  • Broad understanding of diverse infrastructure platforms and concepts
  • Versatility in troubleshooting various hosting technologies (web servers, Java platforms, OS, networks, virtualisation, databases)
  • Knowledge of general networking concepts (CDN, WAF, DNS, PKI)
  • Understanding of security policies and implementation
  • Familiarity with backup and disaster recovery concepts
  • Experience in a production environment supporting mission-critical applications
  • Knowledge of standard production practices, including change management

This role requires a proactive and adaptable individual who can thrive in a dynamic environment and drive innovation, service reliability, and performance excellence.



Job Details

Job Location
Bengaluru India
Company Industry
Other Business Support Services
Company Type
Unspecified
Employment Type
Unspecified
Monthly Salary Range
Unspecified
Number of Vacancies
Unspecified
You have reached your limit of 15 Job Alerts. To create a new Job Alert, delete one of your existing Job Alerts first.
Similar jobs alert created successfully. You can manage alerts in settings.
Similar jobs alert disabled successfully. You can manage alerts in settings.