Site Reliability Engineer IV

Mexico - Guadalajara, Jalisco

Applications have closed
Rackspace logo
Rackspace

The Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. A SRE ensures that Rackspace's managed service offerings & customer deployments have reliability and uptime appropriate to users' needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability and the development of automation to manage a set of tasks at scale.

Key Responsabilities:

  • Supports high complexity deployments and internal teams on an as-needed basis.
  • Responsible for the roll-out and operations of large scale, complex systems automation.
  • Collaborates with other teams on tools for systems automation.
  • Works with leading-edge technology in the managed cloud space.
  • Support stability and up time of client site.
  • Peer feedback.
  • Regular review of system software and hardware requirements.

Skills:

  • The candidate needs to have 5 to 10 years of rich IT experience
  • Deep down understanding of Python and comfortable in coding with OOPs concepts & scripting
  • Experience working with Google Compute Cloud Data Flow and Big Query to manage and move data within Google Cloud
  • Implemented scripts that load Google Big Query data and run queries to export data
  • Experience in Google composer environment and worked on the Airflow for scheduling the jobs Data pipeline using DAG
  • Comprehensive experience with Ansible/Or similar configuration management tool
  • Experience working with Linux administration. General understanding of Linux internals (system calls, file systems, processes, etc.…)
  • Configured a Kubernetes Cluster on GKE and managed, production-ready environment for deploying containerized applications and deployed the Kubernetes dashboard to access the cluster via its web-based user interface.
  • Created Clusters using Kubernetes, kubectland worked on creating many Pods, Replication controllers, Services deployments, Labels, Health checks and ingress by writing YAML files.
  • Awareness of data pipeline, ETL, Business Intelligence, Reporting, Dashboards
  • Know how to debug a python / SQL based code and fix the bug under stressful circumstances within a short time frame
  • Take ownership of the task at hand and exhibit excellent team spirit
  • Communicate with confidence with stakeholders
  • The candidate is aware of the Agile delivery mechanism, Jira and Kanban board 



About Rackspace TechnologyWe are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.  More on Rackspace TechnologyThough we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.
Job region(s): North America
Job stats:  1  0  0

Explore more DevOps, Cloud and SRE career opportunities