Site Reliability Engineer (SRE)

United States (Remote)

Pantheon logo

Pantheon

Pantheon is the WebOps platform where teams build, host, and manage their websites. Pantheon delivers everything your business needs for digital speed and agility. Learn more.
Apply now Apply later

Site Reliability Engineer

Remote

About Pantheon

Pantheon’s WebOps Platform powers the open web, running sites in the cloud for customers including Stitch Fix, Okta, Home Depot, Pernod Ricard and The Barack Obama Foundation. Every day, thousands of developers and marketers create, iterate, and scale websites on the open web to reach billions of people globally. Pantheon’s SaaS model puts large and small web and digital teams in control of increasing the performance of their teams, websites, and marketing programs. Pantheon cloud native software includes governance, security and collaboration tools that make it easy to securely manage a single website or thousands of websites across multiple teams in one platform. The built-in ability to simultaneously create, test, deploy and run live sites with unrivaled hosting speed, scalability and uptime give marketing teams the agility to win in the dynamic world of digital marketing.

With 35% of the web running open-source and significant investments in a $200 billion total addressable market, we are growing aggressively into a huge market opportunity and looking to expand our organization.

The Role

Pantheon is looking for an Site Reliability Engineer to join our team, either remote or onsite at our SF or Mineappolis offices (on US hours.) We’re expanding an impressive and growing platform that powers hundreds of thousands of websites, millions of containerized resources, billions of monthly page views, and development tools that professional website developers use.

Along the way, we’ve written tools to manage containers at scale.  We built a massive multi-tenant distributed file system, CI/CD pipeline, and a cloud-native container-based infrastructure orchestrated with Kubernetes. We have contributed to open source communities such as WordPress, Drupal, Fedora, Chef, systemd, cURL, Kubernetes, Terraform, Sensu, and more. We are looking for SRE’s that are passionate about helping other engineers implement SLO’s across their services. Someone who likes to build tools and be a force-multiplier for other engineering teams.   

Pantheon’s core company values are Trust, Teamwork, Passion, and Customers First. Within Pantheon engineering, we value collaboration, character, autonomy, and a no-blame culture. We're enthusiastic participants in several open-source communities and have real relationships with many of our most active customers. If all of this sounds interesting to you, read on!

Cool Things You'll Do

  • Work on advanced global-scale implementations of systems using the latest in Google Cloud platform offerings.
  • Define and implement services or processes to improve reliability across the pantheon platform using tools like kubernetes, prometheus, Go, and Terraform
  • Assist other teams while they define reliability objectives for services and infrastructure
  • Manage, automate, and Improve common infrastructure (monitoring/metrics/kubernetes)
  • Help develop and improve observability across pantheon engineering
  • Continuous improvements to our standard of engineering excellence by implementing best practices for coding, testing, deploying and communication
  • Support Pantheon as a member of the on-call engineer rotation, contributing to the stability, reliability and performance of the infrastructure that drives Pantheon's success.

What you Bring to the Table

  • You enjoy and have experience with large-scale, high-traffic platforms and the design of scalable, robust services in the real world
  • You are passionate about monitoring, metrics and the SLO process
  • You rather automate than put up with toil
  • You have experience programming with Go, python, ruby, bash or other languages
  • You are a clear communicator, able to represent your contributions and ideas with clarity while remaining open and giving space to the contributions and ideas of others.
  • Take pride in what you can do as part of a team.

Proudly based out of San Francisco, Pantheon is a platform where marketers and developers build, host, and manage their high-value Drupal and WordPress websites. Pantheon's engineering's secret sauce is not our innovative scaling and performance tooling but our passionate, creative, collaborative team.

What We Offer

We have all the usual perks and benefits but what we can really offer you is a fantastic work environment powered by an amazing team.

  • Industry competitive compensation and stock option plan
  • Unlimited time off and sick days
  • Full medical coverage (medical, dental, vision)
  • top-of-line equipment
  • Fun at WordPress and Drupal community events
  • Extra benefits like a stipend for reading books and your work-outs and a whole suite of paid apps for mental as well as physical health and wellbeing
  • Events and activities both team-based and company-wide that inspire, educate and cultivate 

To review the Employee and Applicant's Privacy Policy, click here.

Pantheon is an equal opportunity/affirmative action employer and we welcome applications from all backgrounds regardless of race, color, religion, sex, national origin, ancestry, age, marital status, sexual orientation, gender identity, veteran status, disability, or any other classification protected by law. 

#LI-Remote

#LI-LT1

Job region(s): Remote/Anywhere North America
Job stats:  0  0  0
  • Share this job via
  • or

Explore more DevOps, Cloud and Digital Infrastructure career opportunities