Site Reliability Engineer

San Francisco / Las Vegas / Remote

Remind logo
Apply now Apply later

Posted 3 weeks ago

About the company

Remind, the leading communication platform in education, helps educators reach students and parents where they are: their phones. With nearly 30 million active users, we’re one of the fastest-growing companies in education technology, but we have our sights set on something bigger: giving every student the opportunity to succeed.

Remind runs one of the largest free services in education, one of the fastest growing school/district SaaS businesses (the Remind Plan), and a revolutionary new direct to consumer business (Remind Coaching). The common thread through each of our product lines is the belief that success in education is driven by relationships, and the Remind communication platform is where the next generation of education relationships live.

About this role

The Remind Engineering Team collaborates to deliver features for our users and customers while setting and maintaining SLAs to ensure reliable system performance. We prefer strongly typed languages over dynamic for critical business systems, and leverage both relational and non-relational data structures as needed, supporting tens of thousands of requests per second. We bias towards using the right tool for the job, including Typescript, Python, Go, Ruby, Twirp, GraphQL, and many AWS services (Aurora, Lambda, DynamoDB, SQS, Kinesis).

As a Site Reliability Engineer at Remind, you'll collaborate with our product engineering teams, as well as cross-functional teams, to maximize site availability, performance, and uptime, and build systems and features to enable engineers to ship more quickly and more confidently.

Not in San Francisco? No problem! Our team is distributed within +/-3 hours of Pacific Time.

About you:

  • You have consistently shipped high quality code to production as part of a team
  • You collaborate effectively with engineers and product managers to build systems to increase the leverage of our product engineering teams
  • You write clean code and have significant experience with one or more programming languages
  • You understand the value of an appropriately defined SLA for both internal and external systems and services, and have experience building highly available systems and services which scale and perform in accordance with such an SLA
  • Others enjoy working with you because of your positive attitude and technical competence

What you'll do:

  • Increase the overall availability and performance of our distributed services
  • Support uptime through participation in our eng-wide on-call rotation
  • Help establish, conform to, and audit our SLAs so that the performance of our website exceeds the expectations of students, parents, and educators in even our largest and most demanding school districts
  • Use technologies such as Packer+Ansible, stacker, CloudFormation, Docker, ECS, and Lambda to maintain and improve our foundational infrastructure
  • Improve the deployment process to make it fast and predictable as possible
  • With product engineering teams, debug production issues across services and levels of the stack
  • Partner with product engineering teams to plan the growth of Remind’s infrastructure


  • Competitive salary and equity
  • 401K
  • 100% health coverage for you and your dependents
  • Open vacation policy
  • Paid parental leave

Remind is an equal opportunity employer, and we're committed to diversity and inclusion in the workplace. We aim to represent the students, teachers, and parents we serve, and we welcome, support, and empower all the diverse individuals in our community.

Job tags: Ansible AWS CloudFormation Docker Go GraphQL Lambda Packer Python Ruby
Job region(s): North America Remote/Anywhere
Job stats:  2  0  0
  • Share this job via
  • or