Job Description
Do you ever wonder what happens inside the cloud?DigitalOcean (NYSE: DOCN) simplifies cloud computing so builders can spend more time creating software that changes the world. With our mission-critical infrastructure and fully managed offerings, DigitalOcean enables startups and small and medium-sized businesses (SMBs) to rapidly deploy and scale modern applications. As a remote-first organization, our employees, like our customers, are based around the world.
We are seeking a skilled professional to design, develop, and maintain a robust internal Kubernetes platform.The ideal candidate will collaborate with development teams to ensure seamless integration, develop automation tools, and maintain comprehensive documentation. Monitor system performance, troubleshooting issues, and optimizing resources, as well as implementing observability solutions for insights into platform behavior. Providing technical guidance to teams and staying updated on industry trends to foster continuous improvement. Availability for on-call support during unexpected incidents is required.
What You’ll Be Doing:- Design, develop, and maintain a robust, scalable, and secure internal Kubernetes platform that supports the deployment, scaling, and management of containerized applications.
- Collaborate closely with development teams to understand their requirements and ensure seamless integration of the Kubernetes platform into their workflows.
- Develop and maintain automation scripts, tools, and documentation to streamline deployment and management processes.
- Ensure platform stability and performance by proactively monitoring system metrics, troubleshooting issues, and optimizing resources.
- Implement observability solutions, such as monitoring, logging, and tracing tools, to gain valuable insights into platform performance and application behavior.
- Provide technical guidance and support to development teams, helping them adopt the Kubernetes platform and improve their application deployment and management capabilities.
- Stay current with industry trends, best practices, and emerging technologies related to Kubernetes, containerization, and observability to drive continuous improvement and innovation within the platform.
- You will be on call at times, and expected to handle unexpected incidents.
What We’ll Expect From You:- At least 5 years of experience in platform engineering, infrastructure automation, SRE, or a similar role.
- Understanding of Kubernetes architecture, concepts, and components, with hands-on experience in building and managing Kubernetes platforms in production environments.
- Experienced with Continuous Integration/Continuous Delivery principles (examples: Github Actions, Concourse).
- Solid experience with containerization technologies such as Docker or ContainerD.
- Strong foundation of Linux administration fundamentals.
- Familiarity with observability best practices and tools, such as Prometheus, Grafana, or ELK Stack.
- Strong problem-solving skills and the ability to thrive in a fast-paced, collaborative environment.
- Excellent communication and interpersonal skills, with the ability to convey complex technical concepts to diverse audiences.
- Technical Writing (Documentation, process documents, proposals).
- Experience with Go programming language not mandatory but a nice to have skill.
- Kubernetes certifications are desirable (e.g. Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)).
- Knowledge of the DigitalOcean cloud platform (e.g., droplets, spaces, networking) and infrastructure-as-code tools (e.g., ansible, chef) is advantageous.
Why You’ll Like Working for DigitalOcean:- We are proud to work here. You’ll be a part of a cutting-edge technology company with an upward trajectory, who are proud to simplify cloud computing so builders can spend more time creating software that changes the world. As a member of the team, you will be a Shark who thinks big, bold, and scrappy, like an owner with a bias for action and a powerful sense of responsibility for customers, products, employees, and decisions.
- We prioritize career development. At DO, you’ll do the best work of your career. You will work with some of the smartest and most interesting people in the industry. We are a high-performance organization that will always challenge you to think big. Our organizational development team will provide you with resources to ensure you keep growing. We provide employees with reimbursement for relevant conferences, training, and education. All employees have access to LinkedIn Learning's 10,000+ courses to support their continued growth and development.
- We care about your well-being. Regardless of your location, we will provide you with a competitive array of benefits to support your overall well-being, from one-time work from home stipend to wellness allowance to flexible time off policy, to name a few. While the philosophy around our benefits is the same worldwide, specific benefits may vary based on local regulations and preferences.
- We reward our employees. The salary range for this position is based on market data, relevant years of experience, and skills. You may qualify for a bonus in addition to base salary; bonus amounts are determined based on company and individual performance. We also provide equity compensation to eligible employees, including equity grants upon hire and the option to participate in our Employee Stock Purchase Program.
- We value diversity and inclusion. We are an equal-opportunity employer, and recognize that diversity of thought and background builds stronger teams and products to serve our customers. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
*This role is located in Hyderabad, India
#LI-Hybrid