Site Reliability Engineer II
McLean, VA
Full Time

Everbridge
*Looking for candidates within a 2 hour radius of Mclean, VA. About the role:Everbridge is looking for a skilled member of its SaaS Operations team with functional knowledge in all areas of technology operations and site reliability with particularly emphasis automation in a windows environment. The ideal candidate will fulfill the critical role of ensuring our systems are healthy, monitored, and designed to scale. The successful candidate should have hands-on experience in a web-scale role with emphasis on software-as-a-service. Candidates should also have experience designing, planning, implementing, tuning and operating technology including application servers, virtual machine & container management, large-scale monitoring/trending techniques, micro-service architectures, clustering technology, configuration management and creative scaling techniques. About the team:As a member of our Operations team, you will join a team of dedicated, intelligent, fast-paced engineers. You’ll work in a cutting-edge hybrid cloud environment that will power our company’s impressive growth. You will bring a data driven approach to automation, security, scalability, & monitoring/alerting at Everbridge.
What you'll do:
- Own operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's solutions.
- Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other Operations engineers on designing and implementing highly reliable solutions.
- Embrace engineering principles of proactivity, automation, cross-functional collaboration, curiosity, and data-driven decision making.
- Enhance our infrastructure, tooling, and processes to extend operability as a self-service function for other groups within engineering.
- Participate in a rotating on-call schedule to troubleshoot and resolve production escalations from our 24x7x365 NOC.
- Have fun while we work hard to make a difference.
What you'll bring:
- Bachelor's degree or equivalent
- Ability to obtain and maintain active DoD Secret security clearance
- Large scale production Windows operating system, application, and security maintenance in an online web facing environment
- Orchestration framework, configuration management, and software-defined infrastructure management techniques (i.e. SaltStack, Terraform, Kubernetes, Puppet, Chef, Ansible, etc.)
- Automation experience with PowerShell
- Application virtualization, containerization, and service-oriented-architecture technologies
- Experience supporting & automating systems in a cloud infrastructure – AWS
- US Citizenship and ability to pass a Federal drug screening
- Experience with data systems and distributed systems
- Network architecture and operations with an emphasis on: application load balancing at local and global scale (F5 BIG-IP LTM/GTM, ELB/Route 53), IPv4 routing and dynamic routing protocols (OSPF, BGP), IPsec VPN, and network security best practices
- Hands-on experience with infrastructure as code tools and concepts (e.g. Terraform/Kubernetes/OpenShift/OpenStack/SaltCloud)
- Skill in at least one programming/scripting language. (e.g. PowerShell, Linux Shell .Net/C#, Python, Perl, Ruby, Java, Javascript)
- US Citizenship and ability to pass a Federal drug screening
Job tags:
Ansible
AWS
C
Chef
Java
JavaScript
Kubernetes
Linux
OpenStack
Perl
Puppet
Python
Ruby
Terraform
Virtualization
Windows
Job region(s):
North America