Senior Site Reliability Engineer

India, Hyderabad

Fanatics Inc logo
Fanatics Inc
Apply now Apply later

Posted 1 month ago

Company Overview
Fanatics is the global leader in licensed sports merchandise and changing the way fans purchase their favorite team apparel and jerseys. Through an innovative, tech-infused approach to making and selling fan gear in today's on-demand culture, Fanatics operates more than 300 online and offline stores, including the e-commerce business for all major professional sports leagues (NFL, MLB, NBA, NHL, NASCAR, MLS, PGA), major media brands (NBC Sports, CBS Sports, FOX Sports) and more than 200 collegiate and professional team properties, which include several of the biggest global soccer clubs (Manchester United, Real Madrid, Chelsea, Manchester City). Fanatics offers the largest collection of timeless and timely merchandise whether shopping online, on your phone, in stores, in stadiums or on-site at the world's biggest sporting events.
About the Team
Fanatics is first and foremost a technology company. We are powered by cutting-edge tech created by our small agile teams using the latest tools and technologies under our highly analytical, forward thinking, and open-minded leadership. As the global leader in licensed sports merchandise, we challenge ourselves by improving our new fully responsive NodeJS cloud commerce platform, Elasticsearch engine, and deep data science capabilities while building the best-in-class retail manufacturing and supply chain technologies. Our tech teams work together to revolutionize data science and engineering initiatives, provide highly scalable real-time and streaming platforms, and create secure e-commerce and in-stadium fan experience products. Our own e-commerce platform transacts in over 190 countries, 17 languages, and 14 currencies. Our motto is “#GSD”—get stuff done—and we do just that. If you want to be at the nexus of sports, commerce, and technology, come be a part of our industry-leading team here at Fanatics Tech.
As a member of our Reliability Team, you will help drive collaboration across departments and provide global insights by being the consistent eyes on production.  You will work with application, infrastructure and product teams to ensure smooth launches while meeting proper security gates. You will have the chance to skill-up on communicating for global teams and help bring standards to teams with unique SDLCs.  
This group runs extremely high on constant learning and shared education to avoid silos. To be most effective, you will want to have a solid grasp of engineering principles, infrastructure design, and a mature background in iterative product delivery. 

On the Team you will participate in:

  • Driving Agile teams, Kanban and Scrum, to support both interrupt and project work.  
  • Estimating and delivering projects on-time and within budget through scope shifting and solid communication.
  • Analysis of trends or Golden KPIs to provide useful feedback on anomalies.
  • Building ideas to shift conversations from outage/retro or symptom/prevention to prevention. 
  • Partnerships with Platform teams to remove drag for cross-org engineering (Knowledge of Gitlab, Github/Git, Ansible, Terraform, Kubernetes, Docker, Vault, Consul).
  • Documenting and training others on your team, and providing training and demos.
  • Iterating over improving Change, Release, Incident and Patching processes, with the goal of making them non-events.
  • Optimizing debug procedures for production issues across a variety of technical stacks.
  • Enforcing standards through communication, on design, implementation, security, and change tracking.
  • Building a brave team without a fear of using their voice.  “See something, say something”; “Don't know, ask”; “SRE Silence costs money”.. 

You will be charged with having:

  • A Bachelor’s Degree, or 7-10+ years in eCommerce system delivery.
  • Expertise in at least two areas of application and infrastructure engineering.
  • A strong knowledge of AWS Technologies and a willingness to self-teach with change.
  • Experience with Continuous Delivery and Deployment Automation (Our env: Ansible, Gitlab, Git/Github, Artifactory, Terraform)
  • Capabilities in design and delivery, bringing in projects on budget.
  • An understanding of capacity planning and how to set appropriate limits to optimize cost and performance.
  • Knowledge of identifying system scale, backoff or other throughput challenges to help prevent incidents or resolve them quickly.
  • Experience with performing to metric, SLI/SLO/SLA(s), and making meaningful commitments to customers.
  • History with product behavior, edge cases, failure modes, negative boundary behaviors, load mishaps, etc.., to stop issues before they enter production.
  • A history of building and supporting multiple versions of Linux, and Windows OS.
  • A strong set of programming skills in a preferred language; Go, Python, Java (in that order).
  • An understanding of capacity planning and how to set appropriate limits to optimize resources.
  • Led and participated in blameless RCAs, and have an understanding of the needed outcomes.

As a more senior member on the team, you may:

  • Help to design processes which enable the team to move quickly, and constantly improve speed and quality as a team.
  • Be involved in the hiring process.
  • Knowledge share, or mentoring Jr. teammates.
  • Be accountable to be self aware, handling conflicts and taking feedback in stride.
  • Work to maintain constructive relationships with other teams and de-escalate conflicts when necessary.
  • Lead Reliability Engineers by setting the example we want followed.
  • Drive a culture of ownership and family for the team as we ensure the success of the Company.
Tryouts are open at Fanatics! Our team is passionate, talented, unified, and charged with creating the fan experience of tomorrow. The ball is in your court now.
Job tags: Ansible AWS Docker Elasticsearch Git Gitlab Go Infrastructure design Java Kubernetes Linux Python Streaming Terraform Vault Windows