Senior DevOps Engineer

Redwood City, CA

C3.ai logo
C3.ai
Apply now Apply later

Posted 2 weeks ago

C3.ai is a leading enterprise AI software provider for accelerating digital transformation. The comprehensive and proven C3 AI Suite uses a model-driven abstraction layer to enable organizations to develop, deploy, and operate enterprise scale AI applications 40x to 100x faster than alternative approaches. www.c3.ai 

C3.ai is hiring Cloud Computing DevOps Engineers at our beautiful campus in Redwood City, California. In this role, you will work alongside a tight-knit and talented engineering team to build and deploy AI applications for our customers. You will use your expertise to solve complex challenges and support the core of the C3.ai Platform. 

Meaningful work. Top technology. An award-winning culture and talented team. Join us at C3.ai! 

Your Responsibilities: 

  • Develop and test the cloud infrastructure to scale a rapidly growing C3.ai ecosystem
  • Design, deploy, and manage a massive scale, highly available, fault tolerant, multi-tenant SAAS product 
  • Develop and maintain Continuous Integration (CI)/Continuous Delivery (CD) pipelines on kubernetes 
  • Tier 1 point of escalation from Support for any service availability challenges reported by multitenant customers 
  • Improve automated cloud configuration, deployments, monitoring, management and incident response to support enterprise grade multi-tenant systems
  • Work cross-functionally with various teams to improve C3.ai infrastructure through automation
  • Build internal tools to demonstrate performance and operational efficiency
  • Work with other teams to resolve issues related to application configuration, deployment, or debugging
  • Provide documentation and training of duties to Operations, new staff and related groups
  • Provide system administration, configuration, and troubleshooting of the Linux environment

 Requirements: 

  • Bachelor’s degree in Computer Science, Electrical Engineering, or related field
  •  5+ years' experience in high-availability large-scale Kubernetes cluster deployments, operation, monitoring and maintenance  
  • Strong experience with log monitoring and management with tools including but not limited to, Splunk or Elastic 
  • Strong experience in automating Continuous Integration (CI) and Continuous Deployment (CD) and release management using tools such a Jenkins and Docker Registry 
  • Strong experience with metric monitoring and alerting tools such as Prometheus and Grafana 
  • Strong experience with Python, Bash, Jscript and automation tools (Chef, Puppet, Ansible,  etc.) 
  • 3+ years of experience with using and developing technologies like Cassandra, Spark, Relational Databases, Postgres, RedShift and Docker 
  • Experience with incident response automation tools such as PagerDuty 
  • Experience with Application Performance Monitoring principles and related tools such as NewRelic, Dynatrace, or AppDynamics 
  • Experience with Amazon EKS/Azure AKS or Openshift/Rancher is a plus 
  • Experience with securing cloud environments and monitoring for security breaches 
  • Experience with monitoring and reporting on cloud spend 
  • Experience reporting on Service Level Agreement (SLA) performance metrics such as service up time 
  • Proficiency in Linux administration, configuration, and automation tools
  • Working knowledge of Cloud platforms (AWS, Azure, Google and Cloud Platform)
  • Knowledge of performance benchmarking and diagnostic tools
  • Rigor in high code quality, automated testing, and other engineering best practices

Preferred 

  • Master’s degree in Computer Science, Electrical Engineering, or related field
  • Experience with Scala and Spark 
  • Working experience in deploying Spark on Kubernetes  

C3.ai provides a competitive compensation package and excellent benefits including:

  • Competitive salary, generous stock options, 401K, medical, dental, and vision benefits. At the office, we offer a fully stocked kitchen with catered breakfast and lunch, table tennis and pool table, free membership at our on-site gym, Friday evening social hours with food, drink and music and a fun team of great people.

C3.ai is proud to be an Equal Opportunity and Affirmative Action Employer. We do not discriminate on the basis of any legally protected characteristics, including disabled and veteran status. 

 

Job tags: Ansible AWS Azure Bash CD Chef CI Docker Grafana Kubernetes Linux Postgres Prometheus Puppet Python Redshift Scala Spark