Federal Infrastructure Tech Lead

Washington, DC

Scale AI logo
Scale AI
Apply now Apply later

Posted 1 month ago

Open to candidates located in the San Francisco Bay Area
Scale is looking for a technical leader who wants to bring their passion for infrastructure to help build our world class products. In this role, you would be expected to lead the federal technical efforts. The mission of this team is to provide cutting edge, reliable and easy to use infrastructure for numerous Engineering, Data, and Analytics customers within Scale while meeting the requirements of our federal customers.


  • Use your strong technical background to discuss and guide on complex problems
  • Anticipate strategic and scaling-related challenges via thoughtful planning with your peers
  • Work hands on with the engineers/program managers to design new solutions and take on technical problems yourself
  • Create, build, educate, train and design cloud computing architectures for our Federal customers.
  • Work directly with our federal clients to create backend and infrastructure solutions to meet their challenging data and security needs.
  • Create abstractions of our core infrastructure which can scale to millions of humans and ML models working together.
  • Foster a collaborative, ambitious, and outcome-driven culture that embodies our values.
  • Propose, design, build, and deploy security improvements across scale’s federal environments.
  • Work with our advisors and third party vendors and auditors on pen tests and mitigations.
  • Build systems capable of handling millions of frames of data every day, making it available to both our workforce and our internal teams in a high availability way

This role could be a fit if you have

  • 5+ years of industry experience as a software engineer post graduation
  • Systems engineering experience with real-time and distributed system architecture.
  • Experience building systems that process large volumes of data.
  • Use your strong technical background to discuss and guide on complex problems
  • Experience or interest in using the following: AWS, Typescript, Node, Mongo, MLflow, Spark, Presto, Python (note that we are mostly language-agnostic and are open to using whatever is the best tech for the problem at hand)
  • At least a Bachelor’s degree (or equivalent) in a relevant field.
  • Live in the St Louis, DC, or SF metro areas and be willing to travel.
  • Have security clearance or the ability to hold security clearance. 

Nice to haves

  • Prior startup experience to help us grow responsibly
  • Experience with core AWS technologies such as VPC, EC2, ALB, ASG, Spot Instances
  • Experience in operating or managing Infrastructure such as Spark, Presto, Hive
  • Experience working with Docker, Kubernetes, and Infrastructure as code (eg terraform); especially for running GPU/ML workloads
  • Experience with compliance programs such as SOC2, ISO27001, FedRAMP, HIPAA, PCI or operating within a compliance driven environment.
  • Mentored and grown members of your team or been a tech lead on large projects
  • 2+ years experience leading successful backend, data, or infrastructure teams
  • Experience in operational product focused companies
About Us:At Scale, our mission is to accelerate the development of Machine Learning and AI applications across multiple markets. Our first product is a suite of APIs that allow AI teams to generate high-quality ground truth data. Our customers include OpenAI, Zoox, Lyft, Pinterest, Airbnb, nuTonomy, and many more.
Scale AI is an equal opportunity employer. We aim for every person at Scale to feel like they matter, belong, and can be their authentic selves so they can do their best work. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Scale AI is committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's EEO poster and EEO poster supplement for additional information.
Job tags: AWS Docker EC2 High availability Kubernetes Node Python Spark Terraform
Share this job: