https://bayt.page.link/utfGK5nLme8Xjnmd8
Back to the job results

SRE / Observability Engineer

Today 2025/07/09
Other Business Support Services
Create a job alert for similar positions

Job Description

Company Brief


Company Name : Soho Dragon (https://sohodragon.nyc/)


SoHo Dragon, Ahmedabad is a growing company and is always on the lookout for new, energized talent to join our team. We deliver only the highest standard of service to our customers, and therefore we only hire professionals that are great all-rounders.  Please  Click Here to know more about Soho Dragon.


The SoHo blog Post's from our SoHo MVPs


| Tom Daly – Branding and design | Peter Ward – Microsoft Teams | Anna Jhaveri - Power Apps


Title: SRE / Observability Engineer


Location: Ahmedabad, Pune, Bangalore, Noida, Gurgaon, Hyderabad, Nagpur, Mumbai, Chennai, Goa


Timings: General shift 


Job Description:


We are seeking a highly skilled SRE / Observability Engineer with a strong background in Observability solutions using Monitoring ( ELK, APM tools) and Visualization tools (like Grafana) to join our SRE CoE team. The ideal candidate will have the responsibility of collaborating with different teams in setting up and maintaining monitoring and visualization tools to ensure effective use of technology, tools, and processes to improve operational efficiencies.


Key Skills:


  • Grafana Visualization solution
  • Monitoring / Observability tools - Dynatrace , ELK etc.
  • Platform/ cloud Observability - OpenShift Prometheus / Azure Cloud etc.
  • Automation Skills - API integration, Scripting etc.

Key Responsibilities:


  • Experience in collaborating with various Infrastructure, Applications, platforms, and cloud teams on Observability solutions.
  • Experience implementing monitoring solutions using APM tools and Grafana for visualization - setup, configuration and developing monitoring /alerting solutions. 
  • Experience managing Grafana platform with team-specific dashboards covering various KPIs & data sources , enable with alerts and establish SLOs.
  • Troubleshoot and resolve issues related to Observability solutions - Gaps, challenges and addressing solutions part of Production incidents.
  • Technical Expertise analyzing Infrastructure systems, services, and technologies towards monitoring, alerting and Incident response needs.
  • Experience working in apps, platforms and infra services on resilient infrastructure, scalable, and highly available environment.
  •  Collaborate with App and services teams/SMEs to integrate monitoring solutions through Automation - APIs, webhooks, CI/CD deployments.
  • Document system configurations, standard operating procedures, and best practices.
  • Reflect on latest technologies and trends in Enterprise technologies, platforms, Automation and AI based solutions.

Qualifications:


  • Proven experience in SRE focus on Infrastructure, Applications , Containerized Platforms and Cloud based solutions.
  • Strong experience in monitoring solutions, particularly Prometheus and Grafana.
  • Familiarity with CI/CD pipelines and DevOps practices.
  • Excellent problem-solving and analytical skills.
  • Strong communication and collaboration abilities.
  • Relevant certifications in SRE, Automation and/or OpenShift/Kubernetes are a plus.

You have reached your limit of 15 Job Alerts. To create a new Job Alert, delete one of your existing Job Alerts first.
Similar jobs alert created successfully. You can manage alerts in settings.
Similar jobs alert disabled successfully. You can manage alerts in settings.