Site Reliability Engineer - Cloud SRE (Cloud Services)
United States (Remote)
At HashiCorp, we operate according to a strong set of company principles, many of which are described in The Tao of HashiCorp. We value top-notch collaboration and communication skills, both among internal teams and in how we interact with our users. We take care to balance and be responsive to the needs of our open source community as well as our enterprise level customers.
Engineering at HashiCorp is largely a remote team, and this role is no exception. We are looking for a Full-time Remote Employee within the US, Canada, UK, Germany or the Netherlands. While prior experience working remotely isn't required, we are looking for team members who perform well given a high level of independence and autonomy.
We build Consul, Nomad, Vault, Terraform, Packer, and Vagrant. Alongside of that, we deploy enterprise products for each in a variety of different ways: licensed and unlicensed binaries, appliances to public cloud platforms, and hosted SaaS platforms. Our products help organizations of all sizes run any infrastructure for any application.
At HashiCorp, we value top-notch collaboration and communication skills, both among internal teams and in how we interact with our users. We take care to balance and be responsive to the needs of our open source community as well as our enterprise level customers.
About the role:
The Cloud Services team is an organization focused on delivering Hashicorp’s software as a Cloud service. This effort will enable a distribution model wherein customers can use a fully managed service with an API contract.
In your cover letter, please describe why you're interested in working at HashiCorp, and what draws you to this role in particular! Specifics of your past experiences that are relevant to this role are great to include, too.
In this role, you can expect to:
- Design, implement, and maintain a secure and scalable infrastructure platform for delivering Cloud Services’ applications
- Own and ensure that internal and external SLA’s meet and exceed expectations, System centric KPIs are continuously monitored and improved
- Create tools for automating deployment, monitoring and operations of the overall platform
- Participate in on-call rotation to provide application support, incident management, and troubleshooting
- Provide ongoing maintenance and support of internal tools, improve system health and reliability
- Program mostly in Golang, learning from and contributing to a team committed to continually improving their skills
You may be a good fit for our team if you have:
- Familiarity with infrastructure management and operations lifecycle concepts and ecosystem
- Experience operating and maintaining production systems in a Linux and public cloud environment
- You have prior experience working in high performance or distributed systems; while we strive to hire at a variety of experience levels, this particular opening is not well-suited for recent graduates
- Working knowledge of industry best practices with regard to information security
- You have built or operated a large scale Cloud service
- Comfortable with Go or another low-level programming language
HashiCorp embraces diversity and equal opportunity. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. We believe the more inclusive we are, the better our company will be.