Introduction A career in IBM Software means you’ll be part of a team that transforms our customers challenges into solutions. Seeking new possibilities and always staying curious, we are a team dedicated to creating the world’s leading AI-powered, cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career. IBM’s product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.
Your Role and Responsibilities As a DevOps + Site Reliability Engineer you will work in an agile, collaborative environment to build, deploy, configure, and support services in the IBM Cloud. Your responsibilities will encompass the design and implementation of innovative features/automation, fine-tuning and sustaining existing code for optimal performance, uncovering efficiencies, supporting adopters globally, and driving to deliver a highly available cloud offering within IBM Cloud Security Services. In this role, you will be implementing and consuming APIs in the IBM cloud infrastructure environment while configuring integrating services. You will be a motivated self-starter who loves to solve challenging problems and feels comfortable managing multiple and changing priorities, and meeting deadlines in an entrepreneurial environment. Your primary responsibilities include:
Contributing to new features and improving existing capabilities or processes while relentlessly troubleshooting problems to deliver.
Practice secure development principles supporting continuous integration and delivery leveraging tools such as Tekton, Ansible, and Terraform
Collaborate across teams in activities including code reviews, testing, audit support, and mitigating issues.
Continuously improve code, automation, testing, monitoring and alerting processes to ensure proactive identification and resolution of potential issues.
Lead or contribute to the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes
Participate in on-call rotation and lead or contribute to the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes
Required Technical and Professional Expertise
3+ years of proven development and/or DevOps experience deploying and maintaining in a multi-tiered infrastructure with dependencies and governance
3+ years of experience in large-scale infrastructure design, engineering, and support
3+ years of infrastructure engineering with proven record for delivering high-quality, large-scale solutions. Experience designing architectures for scale and performance
Must be extremely comfortable using and navigating within a Linux environment
Ability to do low level debugging and problem analysis by examining logs and running Unix commands
Lead initiatives to implement and manage advanced configuration management solutions
Must be proficient in writing, debugging, and maintaining automation, scripts and code (ie, Bash, Ansible, and Python, Java or Golang)