Site Reliability and Automation Engineer

Austin, Texas, US

Applications have closed
IBM logo
IBM

Posted 1 month ago


Introduction
Software Developers at IBM are the backbone of our strategic initiatives to design, code, test, and provide industry-leading solutions that make the world run today - planes and trains take off on time, bank transactions complete in the blink of an eye and the world remains safe because of the work our software developers do.  Whether you are working on projects internally or for a client, software development is critical to the success of IBM and our clients worldwide.  At IBM, you will use the latest software development tools, techniques and approaches and work with leading minds in the industry to build solutions you can be proud of.


Your Role and Responsibilities

Are you passionate about technology? Do you love building new things? Do you want to develop the future of IBM's Cloud offerings? If you answered YES, then we have the right opportunity for you!

The shift toward the consumption of IT as a service, i.e., the cloud, is one of the most important changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics, security, commerce, and cognitive computing and with unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.

We are looking for a dynamic, Site Reliability and Automation Engineer to join our Cloud Operations Team, who is responsive to market needs, to deliver value to our clients in a fast-changing cloud landscape. The Cloud team is dedicated to ensuring the IBM Cloud is at the forefront of cloud technology, from data center design to network architecture to storage and compute clusters to flexible infrastructure services. We are building and operating IBM's next generation cloud platform to deliver performance and predictability for our customers' most demanding workloads, at global scale and with leadership efficiency, resiliency and security. It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.

In this Site Reliability and Automation Engineer role, you will work closely with the Data Center, the entire Cloud development organization and IBM vendors to support, maintain and operationally improve the cloud infrastructure. Your focus will be the following key responsibilities:

  • Automate health monitoring of the production and test systems

  • Automate return to service procedures for Cloud Platform Components

  • Support the compliance and security integrity of the environment through your work

  • Partner with other teams, functional managers and program managers to deliver mission-critical services to the market

  • Support development of new and existing capabilities for our compute, storage and network services

  • Integrate automation with operational requirements

  • Work with Engineering to:

    • Define operational requirements

    • Automate operational requirements

    • Participate in the full deployment pipeline

  • Work with Support and Development to:

    • Identify and resolve issues

    • Discuss and plan integration requirements




Required Technical and Professional Expertise
  • Minimum of 5 years’ experience in hands-on production administration of large system environments, including virtual platforms.

  • 5+ years of experience in data center infrastructure or relevant work experience, large-scale infrastructure design, engineering, and support, IT Change, Incident, Problem, Asset management, infrastructure engineering with proven record for delivering high-quality, large-scale solutions. Experience designing architectures for scale and performance

  • Must be efficient in writing, debugging and maintaining scripts (Bash and Python)

  • Ability to do low level debugging and problem analysis by examining logs and running Unix commands

  • 2-3 years of extensive experience with open-source products

  • 3-5 years of experience with configuration management systems (Ansible / Chef)

  • Hands on knowledge of using Splunk or ELK

  • Working knowledge with Network and Storage technologies

  • Working knowledge with ServiceNow, JIRA, Confluence, and GitHub




Preferred Technical and Professional Expertise
  • 2+ years of experience with Kubernetes

  • 4+ years of experience with GitHub, Perl and Python

  • 5+ years of experience with configuration management systems (SaltStack/Ansible/Chef)

  • 8+ years of experience in virtualization environments such as AWS /Softlayer/Zen/VMWARE




About Business Unit
Digitization is accelerating the ongoing evolution of business, and clouds - public, private, and hybrid - enable companies to extend their existing infrastructure and integrate across systems. IBM Cloud provides the security, control, and visibility that our clients have come to expect. We are working to provide the right tools and environment to combine all of our client’s data, no matter where it resides, to respond to changing market dynamics.


Your Life @ IBM
What matters to you when you’re looking for your next career challenge?

Maybe you want to get involved in work that really changes the world? What about somewhere with incredible and diverse career and development opportunities – where you can truly discover your passion? Are you looking for a culture of openness, collaboration and trust – where everyone has a voice? What about all of these? If so, then IBM could be your next career challenge. Join us, not to do something better, but to attempt things you never thought possible.

Impact. Inclusion. Infinite Experiences. Do your best work ever.


About IBM
IBM’s greatest invention is the IBMer. We believe that progress is made through progressive thinking, progressive leadership, progressive policy and progressive action. IBMers believe that the application of intelligence, reason and science can improve business, society and the human condition. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 380,000 IBMers serving clients in 170 countries.


Location Statement
For additional information about location requirements, please discuss with the recruiter following submission of your application.


Being You @ IBM
IBM is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.












Job tags: Ansible AWS Bash Chef ELK Infrastructure design Jira Kubernetes Perl Python Unix Virtualization VMware
Job region(s): North America