Site Reliability Engineer

United States

Everbridge logo
Everbridge
Apply now Apply later

Posted 1 month ago

*This role can be based anywhere in the United States.
Are you motivated by an incredible sense of purpose in doing work that helps keep people safe and business running daily, with results that regularly make headlines? Are you passionate about innovating on the industry’s cutting edge to develop solid architecture principles, operability guidelines, progressive scaling methodologies, and other sophisticated techniques to reliably operate critical technology infrastructure at scale?.
As the Everbridge Site Reliability Engineering team, we are responsible for ensuring overall service quality and availability of Everbridge's solutions. The technology platforms that we support automate the international delivery of critical information to help keep people safe and businesses running.

What you'll do:

  • Own operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's solutions.
  • Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other Operations engineers on designing and implementing highly reliable solutions.
  • Embrace Site Reliability Engineering principles of proactivity, automation, cross-functional collaboration, data-driven decision making, and fast+safe failing to continually improve our technology and culture.
  • Enhance our infrastructure, tooling, and processes to extend operability as a self-service function for other groups in the engineering value stream.
  • Participate in a rotating on-call schedule to troubleshoot and resolve production escalations from our 24x7x365 NOC.
  • Have fun while we work hard to make a difference..

What you'll bring:

  • Previous experience contributing in a production Site Reliability, DevOps, or SaaS/Technical Operations
  • Minimum of 3 years of AWS experience in a production environment
  • Automation framework orchestration, configuration management, and software-defined infrastructure management techniques (SaltStack preferred, others e.g. Puppet, Chef, Ansible, etc. also acceptable)
  • 1+ years of Kubernetes experience (EKS, AKS, GKE, Self managed)
  • Ability to write code in at least one programming language (e.g. Python, Perl, Java, Ruby, Go)
  • Experience with Terraform, Jenkins, Packer and Docker
  • Large scale production UNIX/Linux operating system, application, and security maintenance in an online service provider environment (Ubuntu and Debian GNU/Linux preferred)
  • US Citizenship (or Green Card)

Bonus if you have experience with:

  • Infrastructure/application monitoring and Observability solutions (Datadog, Prometheus, ELK, Graphite/Grafana, InfluxDB, Splunk, Graylog, etc.).
  • Application containerization and service-oriented-architecture technologies (Nomad & rest of HashiCorp suite, Docker, Kubernetes, Mesos, Fedora CoreOS/rkt)
  • Email transport software and deliverability management concepts (Postfix/Sendmail and derivative commercial MTAs, SPF, DomainKeys/DKIM, DMARC, IP reputation).
  • RDBMS,NoSQL,and hybrid data tier platforms(MongoDB,Elasticsearch, Postgres, MySQL, Riak, Cassandra, HBase, etc.).
  • SIEM,HIDS/NIDS,and related infrastructure tooling required to maintain positive control over security.
  • Practical knowledge of BGP traffic engineering, DDoS mitigation, and active threat defense techniques.

Bridger Culture: 
At Everbridge, we have a mission that matters – to keep people safe and businesses running during critical events. Our “Bridgers” join Everbridge to make a positive impact on the world through their work. The core of our company culture is built around making a difference. Our people are dedicated to solving problems during difficult times and challenging situations as our software was built to save lives.
We are a rapidly growing organization transforming the field of critical event management and need passionate, committed and determined individuals to help us carry out our mission. Our environment is dynamic, and our culture is constantly evolving and expanding in order to provide the best employee experience. Click here to learn more about what we do. Passionate about our mission? Want to #BeTheBridge? Apply to be a part of our team today! Everbridge is an Equal Opportunity/Affirmative Action Employer. All qualified Applicants will receive consideration for employment without regard to race, creed, color, religion, or sex including sexual orientation and gender identity, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.
Job tags: Ansible AWS Chef CoreOS Debian Docker Elasticsearch ELK Go Grafana Java Kubernetes Linux Mesos MongoDB MySQL Packer Perl Postgres Prometheus Puppet Python Reliability engineering REST Ruby Terraform Ubuntu Unix
Share this job: