Lead Site Reliability Engineer

Canada - Remote

Applications have closed
SurveyMonkey logo
SurveyMonkey
Use SurveyMonkey to drive your business forward by using our free online survey tool to capture the voices and opinions of the people who matter most to you.
Find more jobs like this

Who we are and what we do

Momentive (formerly SurveyMonkey) is a leader in agile experience management, delivering powerful, purpose-built solutions that bring together the best parts of humanity and technology to redefine AI. Momentive products, including GetFeedback, SurveyMonkey, and its brand and market insights solutions, empower decision-makers at 345,000 organizations worldwide to shape exceptional experiences. More than 20 million active users rely on Momentive to fuel market insights, brand insights, employee experience, customer experience, and product experience. Ultimately, the company's vision is to raise the bar for human experiences by amplifying individual voices. Learn more at momentive.ai.

More about our Site Reliability Engineering (SRE) Team

As a member of the new SRE team at SurveyMonkey, you will automate away toil, help teams in AWS, and build tools to make development seamless. The SRE team is embedded amongst several developments that are responsible for some of the most trafficked applications within SurveyMonkey. The SRE team will be fully remote across North American time zones. This role presents a prime opportunity to ensure reliability, maintainability, performance, scalability, and security are at the forefront of our teams' products.

What we're looking for

The Staff Site Reliability Engineer will partner with the application development and main infrastructure teams to architect and operate reliable, scalable, and performant services. You will have a huge impact on how we do things and help take our engineering excellence to the next level. You will report to the Senior Manager of Site Reliability Engineering.

You will

  • Partner with application developers and architects to ensure our services are built for scale and performance.
  • Develop the monitoring solutions on top of existing observability platforms
  • Refine the development, build and deployment processes on top of our main infrastructure
  • Work with the engineering teams to architect and build our platform services to simplify real-time troubleshooting and operational response to incidents and outages
  • Be the expert on how to best use AWS technologies to build our next-generation platform
  • Bridge the divide between our core application engineers and our main infrastructure teams
  • Provide capacity management expertise to ensure our deployments are managed for robustness and cost
  • Bring best practices and own environment management, ensuring all of our dev/test/prod environments are reproducible with high availability

You have

  • A minimum 12 years experience operating in a large-scale environment
  • Experience architecting, scaling, and improving the services and customer experiences of the platforms you support
  • Experience in application architecture designs and implementations, including the operational and strategic trade-offs of different designs
  • Knowledge of different aspects of service design: including messaging protocols and behavior, caching strategies, and software design practices
  • Developed accomplished SRE engineers ranging from junior to senior levels of experience
  • Experience in communicating technical concepts with operational and development teams.

What we offer our employees

Momentive is a place where the curious come to grow and shape what's next. By embedding inclusion into our processes, policies, and culture for our 1,400+ employees across North America, Europe, and APAC, we're building a workplace where people of every background can excel. We've won multiple awards and received recognition for our forward-looking policies, including extended parental and bereavement leave, vendor benefits standards, and Take 4 sabbaticals.

Momentive is featured as a Glassdoor 2021 Best Place to Work and National Capital Region's Top Employer in Canada (2021). In 2020, Momentive was recognized as a top place to work by Glassdoor Best Places to Work, Fortune Best Places to Work in the Bay Area, Parity.org's Best Companies for Women to Advance, and National Capital Region's Top Employers in Canada. Momentive has consistently been recognized by Great Place to Work® and Fortune as a top workplace since 2018, and we have also won numerous awards as a leader in global survey software, including being named among the G2 Best Software Companies, CNBC's Disruptor 50, and the Forbes Cloud 100.

Our commitment to an inclusive workplace

Momentive is an equal opportunity employer and is committed to providing a workplace free from harassment and discrimination. We celebrate the unique differences of our employees because that is what drives curiosity, innovation, and the success of our business. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate. Accommodations are available for applicants with disabilities.

Learn more about our diversity, equity, and inclusion efforts here.

#LI-remote

Job region(s): Remote/Anywhere North America
Job stats:  4  0  0

Explore more DevOps, Cloud and SRE career opportunities