Director of Site Reliability Engineering


Aircall logo
Apply now Apply later

Posted 1 month ago

Aircall is on a mission to revolutionize the business phone industry!
We are an advanced, cloud-based business phone system and call center software — all wrapped up in one single tool (no hardware, 100% integrated).
But behind our product are the people driving it. Ambition, Community, Teamwork and Transparency – these are the values we live by at Aircall. We know that success comes from smart work and deserves to be recognized and rewarded.
If you love a good challenge, enjoy solving meaningful problems, and want to be a part of one of the fastest growing B2B startups — then Aircall is the company you are looking for!

As  Director of Site Reliability Engineering, you will be responsible for the delivery and operational integrity of the system and business-critical features that add customer value on top of Voice. Providing technical and architectural best practices, evangelization, and mentoring in your team and across the whole of Engineering will be part of your day to day job.
Using a variety of back-end stacks, appropriate for each requirement, but always hosted on AWS, we build added value and resilient services on top of voice and fully integrate with our customers’ business critical tools (CRM, Helpdesk, E-Commerce, …).

Your mission @ Aircall:

  • You lead, organise and empower a team in order to deliver cross-functional projects delivering high quality, secure, voice solutions to Aircall clients from conception to production.
  • Work with wider Engineering teams to define, refine and communicate the platform's architectural strategy, standards and best practices to help improve availability, reliability and resilience of our global infrastructure and systems.
  • Involvement in product and platform performance optimization and live site monitoring spreading DevOps culture throughout the organisation.
  • Promote the design, build and maintain infrastructure through reusable code and tooling.
  • You create solutions to continuously improve scalability, performance and security of our platform and see elimination of waste as a key part of your efforts.
  • You take responsibility for organizing and maintaining an operational framework that ensures respect of business service level agreements.
  • Maintain awareness and knowledge of current and emerging, AWS infrastructure and services, to enable adoption and use across Engineering.
  • You coordinate communications, actions and gather requirements with other departments (eg: Marketing, Sales, Care)

A little more about you:

  • 5+ years experience working as an SRE engineer (with at least 2+ as a lead engineer) for highly available, resilient, fault tolerant systems that utilize load balancing, horizontal/vertical scaling and high availability.
  • You will have worked extensively with AWS in a production environment and understand how to design for, build and deploy on and get the best out of the environment and services provided by Amazon
  • You can talk easily and spontaneously about most of the offerings on AWS, when to use them, and how, and can explain to junior engineers.
  • Security is at the forefront of your mind in everything that you do.
  • You are rigorous with code quality and other engineering best practices (testability, maintainability), and are accustomed to using systems such as Jira, Confluence, BitBucket, GitHub, GitPrime, etc.
  • You possess effective and proactive communication skills, and can collaborate with different profiles and roles, from junior to senior, tech to business.
  • You are used to working in Agile teams and look for and implement  continuous improvement, but you also appreciate good process and quality assurance in mitigating risk and improving quality.
  • Provide technical leadership across multiple teams, manage debt and legacy code with business, technical risk mitigation
  • You invest your technical and interpersonal experience to mentor juniors and include other engineers in your actions
  • You have experience in identifying, debugging and solving complex production issues.

Nice to have :)

  • Experience of AWS AppSync, AWS Aurora, delivery of solutions using serverless including Amazon AWS Lambda technology and experience in distributed event based microservices architecture, such as Amazon SNS
  • Have good programming skills, preferably typed language (Typescript/Node.js, Java, Scala), with experience in Javascript, Ruby, or Python, with a focus on delivering for security, scalability, availability, and performance
Why joining us?
🚀 Key moment to join Aircall in term of growth and opportunities💆‍♀️ Our people matter, work-life balance is important at Aircall📚 Fast-learning environment, entrepreneurial and strong team spirit🌍 30+ Nationalities: cosmopolite & multi-cultural mindset🌞 Sunny offices in the center of Paris with incredible perks and regular team parties💶 Competitive salary package & benefits (health coverage, lunch, commute, sport)
Aircall is committed to building a diverse, equitable and inclusive workforce. We are an equal opportunity employer and welcome qualified applicants, regardless of gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, pregnancy status, veteran status, or any other differences. If you have a disability or special need that requires accommodation, please let us know. Members of communities historically underrepresented in tech are encouraged to apply.
Job tags: AWS High availability Java JavaScript Jira JS Lambda Load Balancing Node Node.js Python Reliability engineering Ruby Scala