Site Reliability Engineer

Apply now

We're looking for a Site Reliability Engineer to join our growing team, someone who enjoys working collaboratively and who will be eager to learn about all aspects of our platform. Whether you're a Software Engineer working in a DevOps environment or a full time operations or sysadmin engineer doesn't matter to us, we want someone with experience of managing a cloud native platform, with the ability and enthusiasm to learn new things along the way. We're a small, friendly team with plenty of scope for individuals to make a real impact on the product and the way that we work. The technologies we are using include JavaScript, Ruby, Go, Kubernetes, MySQL, Stackdriver and Elasticsearch, running on AWS and GCP.

Skills & Requirements

Your responsibilities will include:

  • Improving the whole lifecycle of services - from inception and design, through deployment, operation and refinement
  • Support services before they go live through activities such as design reviews and capacity planning
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response

The following attributes are critical to the role:

  • You have worked with at least one of AWS, GCP or Azure
  • You have experience of managing containers and Linux servers in a production environment
  • You can debug and troubleshoot problems across a distributed system
  • You exhibit a systematic problem-solving approach
  • You fundamentally will not accept doing things over and over by hand
  • You can communicate well with both engineers and other members of the business
  • You are comfortable coding in one or more of the following: Go, Ruby, Python or shell scripting

Any of the following experiences would be really useful:

  • Working within a Continuous Delivery focused environment
  • Mentoring other engineers
  • Administering MySQL databases
  • Management of an Elasticsearch cluster


For more information or to apply, contact

Check out our Engineering Blog on Medium

To learn more about Kudos, our culture and benefits see here