Staff Site Reliability Engineer

Canada - Remote

SurveyMonkey logo
SurveyMonkey
Apply now Apply later

Posted 3 weeks ago

Who we are and what we do

SurveyMonkey (Nasdaq: SVMK), is a leader in agile software solutions for customer experience, market research, and survey feedback. Our platform empowers more than 20 million active users to analyze and act on feedback from employees, customers, website and app users, and market research respondents. SurveyMonkey's products, enterprise solutions, and integrations enable more than 345,000 organizations to deliver better customer experiences, increase employee retention , and unlock growth and innovation. Ultimately, SurveyMonkey's vision is to raise the bar for human experiences by amplifying individual voices.

More about our Site Reliability Engineering (SRE) Team

As a member of the new SRE team at SurveyMonkey, you will automate away toil, help teams in AWS, and build tools to make development seamless. The SRE team is embedded amongst several developments that are responsible for some of the most trafficked applications within SurveyMonkey. The SRE team will be fully remote across North American time zones. This role presents a prime opportunity to ensure reliability, maintainability, performance, scalability, and security are at the forefront of our teams' products.

What we're looking for

The Staff Site Reliability Engineer will partner with the application development and main infrastructure teams to architect and operate reliable, scalable, and performant services. You will have a huge impact on how we do things and help take our engineering excellence to the next level. You will report to the Senior Manager of Site Reliability Engineering.

You will

  • Partner with application developers and architects to ensure our services are built for scale and performance.
  • Develop the monitoring solutions on top of existing observability platforms
  • Refine the development, build and deployment processes on top of our main infrastructure
  • Work with the engineering teams to architect and build our platform services to simplify real-time troubleshooting and operational response to incidents and outages
  • Be the expert on how to best use AWS technologies to build our next-generation platform
  • Bridge the divide between our core application engineers and our main infrastructure teams
  • Provide capacity management expertise to ensure our deployments are managed for robustness and cost
  • Bring best practices and own environment management, ensuring all of our dev/test/prod environments are reproducible with high availability

You have

  • A minimum 12 years experience operating in a large-scale environment
  • Experience improving the services and customer experiences of the platforms you support
  • Experience in application architecture designs and implementations, including the operational trade-offs of different designs
  • Knowledge of different aspects of service design: including messaging protocols and behavior, caching strategies, and software design practices
  • Experience making strategic trade-offs that are in priority when needed
  • Developed accomplished SRE engineers ranging from junior to senior levels of experience

What we offer our employees

SurveyMonkey is a place where the curious come to grow. By embedding inclusion into our processes, policies, and culture for our 1,000+ employees across North America, Europe, and APAC, we're building a workplace where people of every background can thrive. We've won multiple awards and received recognition for our forward-looking policies, including extended parental and bereavement leave, vendor benefits standards, and Take 4 sabbaticals.

SurveyMonkey was recognized by Great Place to Work® and FORTUNE as a top workplace in 2018 and 2019, and the company has also won numerous awards as a leader in global survey software, including being named among CNBC's Disruptor 50 and the Forbes Cloud 100.

Our commitment to an inclusive workplace

SurveyMonkey is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Accommodations are available for applicants with disabilities.

Learn more about our diversity, equity, and inclusion efforts here.

Job tags: AWS High availability Reliability engineering
Job region(s): North America Remote/Anywhere
Job stats:  4  0  0
  • Share this job via
  • or

More DevOps and Cloud position highlights