Lead Site Reliability Engineer


Grand Rounds logo
Grand Rounds
Apply now Apply later

Posted 3 weeks ago

Are you passionate about building highly available, resilient systems? How about leading a team to deliver impact at scale? We are looking for a hands-on Lead Engineer to drive important cloud infrastructure initiatives using their keen understanding of where our business is headed and a mastery of their craft to help us get there.
The Platform Engineering team builds cutting-edge infrastructure that supports the mission-critical components of our Cloud, Data and Machine Learning Platforms. As a member of Grand Rounds Health's Platform Engineering team, you will collaborate with Data Engineers, Data Scientists, and Product Engineers to design and ship the infrastructure that is based upon Kubernetes, Istio and Docker.

Initial Projects Include:

  • Cross region kubernetes federation
  • Canary/blue green deployments with Istio and Tekton
  • Build a new observability stack: logs, metrics and tracing
  • We believe that getting to operational excellence is also cultural and as such we are responsible for the training, processes and maturity model of all services

About You:

  • You have experience leading and scaling teams to consistently deliver large infrastructure projects
  • You have a commanding grasp of Kubernetes, networking and distributed systems and strong programming skills in Python or Go, CS fundamentals and a track record of implementing highly reliable software 
  • You optimize for impact over progress and love getting stuff done (GSD)
  • You care about software and are proud of what you do; to you it’s more than a job

What You Will Do:

  • Deliver ambitious projects that interact with a wide variety of teams within the company
  • Design systems that will span multiple AWS regions to enable high availability of the platform
  • Help set technical direction for the entire infrastructure team at the 1+ year horizon

Bonus Points:

  • Experience with Istio and Docker
  • Design of robust distributed systems
  • Experience with AWS cloud infrastructure and its alphabet soup of technologies including EKS, RDS, ALB/ELB, IAM
About us:Grand Rounds is a tech-driven healthcare company dedicated to raising the standard of healthcare for everyone, everywhere. By harnessing the power of technology, we connect nearly 6 million members to top-rated doctors and data-driven insights to make better informed healthcare decisions. Driven by premier thought leaders in patient care, technology, and business since 2011, our team of 900+ proudly serves Walmart, Costco, Salesforce, and over 140 of America’s top employers as a free employee benefit. Chosen as a 2019 Best Place to Work by Glassdoor and a 2020 UCSF Digital Health Award winner for Employer Wellness, Grand Rounds thrives at the forefront of technology-driven healthcare innovation. Learn more at https://grandrounds.com/
-----Grand Rounds is an Equal Opportunity Employer and considers applicants for employment without regard to race, color, religion, sex, orientation, national origin, age, disability, genetics or any other basis forbidden under federal, state, or local law. Grand Rounds considers all qualified applicants in accordance with the San Francisco Fair Chance Ordinance.
Job tags: AWS Docker Go High availability Kubernetes Python
Job region(s): Remote/Anywhere
Job stats:  8  1  0
  • Share this job via
  • or