Data Reliability Engineer II

United States

Applications have closed
Everbridge logo
Everbridge

Posted 1 month ago

*This role can be based anywhere in the United States.
Do you have an insatiable appetite for streamlining out inefficiency, automating away toil, and proactively eliminating problems before they occur in the first place? If so, this position is a perfect opportunity for you to join the Everbridge Database Reliability Engineering team in a hands-on role driving the design, implementation, and operation of our global platforms.
Who we are:As the Everbridge Database Reliability Engineering team, we are responsible for ensuring overall service quality and availability of Everbridge's solutions. The technology platforms that we support automate the international delivery of critical information to help keep people safe and businesses running. We are a 24x7x365 distributed team that can do our job anytime, anywhere on the planet with an Internet connection.

What you'll do:

  • Own operational availability, security, performance, scalability, efficiency, monitoring, instrumentation, integrity, and overall service reliability of Everbridge's data tier.
  • Collaborate across Agile teams with Architects, Developers, Quality, Security, and other Operations engineers on designing and implementing highly reliable data solutions.
  • Develop and enhance our infrastructure-as-code tooling and the processes that extend operability, resiliency and self-service of our data tier
  • Embrace Site Reliability Engineering principles of proactivity, automation, cross-functional collaboration, data-driven decision making, and fast+safe failing to continually improve our technology and culture.
  • Participate in a rotating on-call schedule to troubleshoot and resolve production escalations from our 24x7x365 NOC.
  • Have fun while we work hard to make a difference.

What you'll bring:

  • 4+ years of experience in production Data Reliability Engineering or database administration in a SaaS/DevOps environment
  • 2+ years of experience writing code in at least one programming language (Python preferred)
  • 2+ years of experience managing cloud data infrastructure (AWS preferred) with configuration management and orchestration tools (SaltStack and Terraform preferred)
  • 2+ years of experience with MongoDB or ElasticSearch/ELK administration
  • 2+ years of experience with CI/CD and automation (Jenkins preferred)

Bonus if:

  • Relational database administration, such as PostgreSQL or SQL Server
  • Data streaming, such as Kinesis or Kafka
  • Data store high availability and disaster recovery
  • Infrastructure health monitoring, alerting, and troubleshooting
Job tags: AWS CD CI Elasticsearch ELK High availability Kafka MongoDB PostgreSQL Python Reliability engineering SQL Streaming Terraform
Job region(s): North America
Job metrics:  0  0  0