Senior Software Engineer, ML Infrastructure

Seattle, WA

Full Time Senior-level / Expert
TuSimple logo
TuSimple builds the world's largest Autonomous Trucks and most advanced Driverless vehicles. Find self-driving freight trucking, Long Haul, Semi-trailer and autopilot systems.
Apply now Apply later

This role is open to remote work or relocation to San Diego, CA. Relocation assistance is provided.

For immediate consideration please email your resume to:

Company Overview

Come join a higher calling and find a deeper purpose!   

As a multi-national Artificial Intelligence Technology Company, we are at the epicenter of the Autonomous Vehicle Universe. Our breakthroughs are leading the industry in autonomous trucking.  

While inventing the framework of Autonomous Driving, our current fleet of autonomous Trucks are helping communities receive much-needed supplies and medical equipment around the clock. Our people are some of the most talented engineers and contributors who are leaving behind a historic legacy.  

TuSimple was founded half a decade ago with the goal of bringing the top minds in the world together to achieve the dream of a driverless truck solution. With a foundation in computer vision, algorithms, mapping, and Artificial Intelligence, TuSimple is working to create the first global commercially viable autonomous truck driving platform!

About us

In just a few short years, TuSimple has undergone a meteoric rise to become the premier operator of autonomous heavy trucks today. We are building a global autonomous freight network atop a foundation of state-of-the-art advancements in AI research, with publications in leading journals being brought to life today in the autonomous agents they power on the roads we all drive on. 

Job Description

The ML Infrastructure team’s mission is to provide reliable, scalable and self-improving machine learning platform solutions to support the development of critical vehicle systems. As a Senior Software Engineer, ML Infrastructure, you will work alongside some of the brightest minds in the world to address unsolved problems on the bleeding edge of applied AI research. You will build hardened, highly available and fault-tolerant full-stack systems to support, challenge and improve critical AI modules in on-vehicle perception. This will include: 

  • Creating and maintaining cloud-based distributed algorithm training environments in a way that is maintainable, scalable and cost effective;
  • Creating and maintaining scalable full-stack solutions for the versioning and retrieval of models, training data, benchmarks, and challenge datasets;
  • Creating automated AI tools that will push our critical autonomous systems to the next level, as they stress-test safety, reliability and consistency across real-world scenarios.

This role is a unique opportunity to parlay your back-end software engineering talents into substantial machine learning and AI experience, working on uncharted real-world problems that will not simply “disrupt” the world of posting cat pictures online, but actually change the transportation industry forever. 

Required Skills

  • Strong verbal and written communication skills;
  • Proficiency in Python or C++;
  • Minimum 3-5 years experience writing and documenting full-stack APIs or SDKs for other developers;
  • Minimum 3-5 years experience with modern version control systems (Git, TFS, Mercurial etc.); 
  • 3+ years experience building and deploying Docker containers and orchestration frameworks (Kubernetes, Mesos, Amazon EKS etc.);
  • 3+ years administrating and developing with distributed computing frameworks (Apache Spark, Flink, Hadoop, HDFS etc.);
  • 3+ years experience with cloud-based and on-prem big data storage solutions;
  • 3+ years of experience deploying software on cloud environments (AWS, Azure, GCP, Oracle Cloud Infrastructure, etc).

Desired Skills

  • Experience developing platforms or SDKs related to machine learning vizualization, training efficiency, experimentation, continuous integration, or active learning;
  • Experience provisioning or maintaining large scale ML tooling (Amazon SageMaker, Tensorflow-on-Spark, Horovod, Kubeflow);
  • Experience with Terraform and Ansible for infrastructure management;
  • Experience with AWS CloudWatch or other cloud management solutions;
  • Experience developing in a hybrid cloud environment; 
  • Extensive visualization experience with Vue, D3js, ThreeJS etc.


  • Visa sponsorship is available for this position 
  • Opportunity for professional growth and career advancement 
  • Competitive salary and benefits
  • Up to a 30% discretionary bonus. 
  • Daily breakfast, lunch, and dinner
  • Shape the landscape of autonomous driving
  • 100% Company paid Medical, Vision, and Dental insurance plan
  • Company 401(K) program
  • Company paid life insurance
  • Company paid education/training. 
  • Company paid gym membership.  

TuSimple is an Equal Opportunity Employer. This company does not discriminate in employment and personnel practices on the basis of race, sex, age, handicap, religion, national origin or any other basis prohibited by applicable law. Hiring, transferring and promotion practices are performed without regard to the above listed items.

This role is open to remote work or relocation to San Diego, CA. Relocation assistance is provided.

For immediate consideration please email your resume to:




Job region(s): North America
Job stats:  0  0  0
  • Share this job via
  • or

Explore more DevOps, Cloud and SRE career opportunities