Senior Database Infrastructure Engineer

San Francisco, California

Full Time Senior-level / Expert
Udemy, Inc. logo
Udemy, Inc.
Apply now Apply later

Posted 3 weeks ago

Udemy is the leading global marketplace for teaching and learning, connecting students everywhere to the world’s best online classes. We are looking for a Senior Datastore Reliability Engineer to join our Datastore Infrastructure Engineering Team. With a commitment to innovation, we embrace automation and agile culture, love technical challenges and are eager to adopt new technologies and tools. We are responsible for all aspects of MySQL, Redis, Memcached, DynamoDB, Kafka, Consul and RabbitMQ datastores and proxies like ProxySQL, Envoy and MCRouter across environments including production. Our primary tools are Terraform, Ansible, Atlantis, Datadog, Percona Management Tools, Github and Python. We value teamwork, good humor, strong sense of ownership, technological curiosity, and a desire to learn.You will work with a wide range of relational and NoSQL data sources improving reliability and performance of our growing datastore ecosystem. DSI team members are primarily located in San Francisco, US and Dublin, Ireland. 

Here’s what you’ll be doing:

  • Analyze, improve and automate datastore maintenance flows, backup and recovery procedures, capacity management, and access monitoring 
  • Proactively respond to production infrastructure alerts and warnings, mitigate production issues as they arise and transform incident lessons into automation, documentation and monitoring
  • Work with SRE and engineering teams to review and deploy changes to production environment, advise on datastore availability and scalability policies and best practices
  • Develop and enhance datastore production environment monitoring, observability  and management capabilities using existing and new tools and platforms
  • Manage replication and failover topology for MySQL, Memcached, Redis, Kafka, DynamoDB and RabbitMQ
  • Perform code reviews and answer datastore related infrastructure questions
  • Participate in On-Call rotation

We’re excited about you because you have:

  • Passion for performance, observability, availability and scalability
  • Expert-level knowledge, administration skills and hands on experience with two or more of the following datastores: MySQL, Redis, DynamoDB, RabbitMQ, Memcached, Kafka
  • Solid software engineering skills with proficiency in at least one of programming languages like Python 
  • Comfortable with infrastructure automation and configuration management tools such as Terraform and Ansible
  • Experience with automated testing and continuous integration when dealing with infrastructure as code (e.g. Molecule, Atlantis)
  • Experience with containers and container orchestrators such as Kubernetes 
  • Good understanding of Linux/Unix fundamentals and debugging skills
  • Experience building and operating distributed data storage systems with hundreds of nodes
  • 5+ years experience managing large-scale database systems in Cloud (AWS prefered) and/or hybrid environments
About UdemyWe believe anyone can build the life they imagine through online learning. Today, more than 40 million students around the world are advancing their careers and passions by exploring and mastering new skills on Udemy, and expert instructors are able to share their knowledge with the world. Through our global marketplace and our solutions for businesses and governments, we connect people everywhere with the skills they need for success in work and life. We’re a close-knit bunch that enjoys problem-solving and collaboration, and we share a serious belief in the power of learning and teaching to change lives. Udemy’s culture encourages innovation, creativity, passion, and teamwork. We also celebrate our milestones and support each other every day.
Founded in 2010, Udemy is privately owned and headquartered in San Francisco’s SOMA neighborhood with offices in Denver (Colorado), Dublin (Ireland), Ankara (Turkey), Gurugram (India), and São Paulo (Brazil).
Udemy in the NewsUdemy Adds More than $1 Billion To Its Valuation in New Funding RoundUdemy’s Workplace Learning Tool Just Surpassed $100M in ARRPaid Paternity Leave Should be the Norm in the U.S.Breakdown of Most In-Demand Skills for 2020—Finance, Marketing, Sales and EngineeringHow Investing in Yourself Today Will Set You Up for Career Success TomorrowFeedback Isn’t the Problem, but the Way That We Deliver It Is Broken
Job tags: Ansible AWS HTML Kafka Kubernetes Linux MySQL Python RabbitMQ Redis Terraform Unix
Job region(s): North America
Job stats:  0  0  0
  • Share this job via
  • or