https://bayt.page.link/xkJZVVqwV38GkX8X9
أنشئ تنبيهًا وظيفيًا للوظائف المشابهة

الوصف الوظيفي

Introduction
Software Developers at IBM are the backbone of our strategic initiatives to design, code, test, and provide industry-leading solutions that make the world run today. At IBM, you will use the latest software development tools, techniques and approaches and work with leading minds in the industry to build solutions you can be proud of.

Are you passionate about technology? Do you love building new things? Do you want to develop the future of IBM’s Cloud offerings? If you answered YES, then we have the right opportunity for you!


The shift toward the consumption of IT as a service, i.e., the cloud, is one of the most important changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics, security, commerce, and cognitive computing and with unmatched hardware and software design and enterprise reach, no other company is as well positioned to address the full opportunity of cloud computing.


We are looking for a dynamic Site Reliability Engineer to join our Cloud IaaS Team in Bangalore, India, who is responsive to market needs, to deliver value to our clients in a fast-changing cloud landscape. The SRE team dedicated to ensuring that the IBM Cloud is at the forefront of cloud technology, from data center design, Storage & Network architecture and compute clusters to flexible infrastructure services. We are building IBM’s next generation cloud platform to deliver performance and predictability for our customers’ most demanding workloads, at global scale and with leadership efficiency, resiliency and security. It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.

Your Role and Responsibilities
In this Site Reliability Engineer role, you will work closely with several Data Centers, the entire Cloud organization and IBM vendors to support, maintain and continously improve the IBM cloud infrastructure. You will focus on the following key responsibilities:


  • Design & implement automation/infrastructure solutions for IBM Cloud products and services.
  • Partner with other SRE teams and dev leaders to deliver mission-critical services to IBM Cloud
  • Build new tools to improve automated resolution of production issues
  • Monitor, respond promptly to production alerts, execute changes in Production through automation
  • Support the compliance and security integrity of the environment
  • Continually improve systems and processes regarding automation and monitoring
  • Work with Support and Development teams to identify the root cause to resolve issues
  • Discuss and plan continuous improvement in the stability of production environment
  • Guide & provide technical escalation support for other Infrastructure Operations teams


Required Technical and Professional Expertise


  • Excellent written and verbal communication skills.
  • Overall 10+ years of experience in Public Cloud infrastructure
  • Minimum 6+ year’s experience in handling large production systems in a cloud environment
  • Strong skills on Linux, Scripting, Debugging complex issues working with other teams
  • Ability to handle complex customer situations to resolution
  • 7+ years of experience in Virtualization Technologies and Automation / Configuration Managements
  • Automation and configuration management tools/solutions: Ansible, Python, bash, Terraform, GoLang etc. (any two)
  • Virtualization technologies: Citrix Xen Hypervisor (Preferred), KVM(also preferred), libvirt, VMware vSphere, etc. (at least one)
  • Monitoring technologies: Zabbix (preferred), Sysdig, Grafana, Nagios, Splunk, etc. (at least one)
  • Strong skills on Container technologies: Kubernetes, Docker, etc.
  • Work with Engineering to:
  • Provide initial assessment and possible workaround of production issue
  • Troubleshoot and resolve production issues
  • Working knowledge with ServiceNow, JIRA, and GitHub


Preferred Technical and Professional Expertise


  • Knowledge of compute, storage & networking systems in a public cloud environment

لقد تجاوزت الحد الأقصى لعدد التنبيهات الوظيفية المسموح بإضافتها والذي يبلغ 15. يرجى حذف إحدى التنبيهات الوظيفية الحالية لإضافة تنبيه جديد
تم إنشاء تنبيه للوظائف المماثلة بنجاح. يمكنك إدارة التنبيهات عبر الذهاب إلى الإعدادات.
تم إلغاء تفعيل تنبيه الوظائف المماثلة بنجاح. يمكنك إدارة التنبيهات عبر الذهاب إلى الإعدادات.