Site Reliability Engineer (SRE)

Lisbon/Porto, Portugal

Tillster logo
Tillster
Apply now Apply later

Posted 1 month ago

Site Reliability Engineer (SRE)

Lisbon or Porto, Portugal

High-level…

Our Site Reliability Engineer (SRE) is responsible for the availability, performance, monitoring, release engineering, and incident response, among other things the platforms and services the company runs and owns. SRE ensures that enterprise services have reliability and uptime appropriate to defined service levels. SRE’s are focused on optimizing existing systems, building cloud infrastructure, and eliminating manual work through automation.  

Down in the weeds…

  • Lead Tillster’s SRE product efforts, while working closely with cross functional teams/departments. 
  • Analyzing and troubleshooting large-scale distributed systems in the public cloud
  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity
  • Define and monitor top-level product KPIs and drive prioritization/strategy discussions leveraging both data and intuition.
  • Provide top of the line support for our high-growth, global product and engineering organization.
  • Proactively predict, triage, and debug system issues
    Job requirements

Required Skills & Experience…

  • Knack for catalyzing new system adoption through evangelism, education, and self-service functionality 
  • Implement best SRE practices in documenting and making improvements to infrastructure
  • Wants to help teams build pipelines and the tooling to keep them running.
  • Demonstrated technical aptitude 
  • BS. or M.S. in Engineering, Computer Science, technical degree, or equivalent work experience
  • Strong problem solving and strategic thinking skills 

Nice-to-Haves…

  • Experience with Kubernetes, Helm, Terraform, Ansible, and JenkinsX
  • Experience with AWS
  • Experience with systems thinking architecture and design
  • Strong knowledge of industry trends and innovations in cluster management and cloud technology.
Job tags: Ansible AWS Kubernetes Terraform
Share this job: