Site Reliability Engineer (Hybrid)

Site Reliability Engineer (Hybrid)

Our London-based client is hiring an SRE having 3+ years of experience in a similar role.

You will be required to report to an office based in London.

Sponsorship is not provided for this role.

 

Responsibilities

  • Build and maintain software and systems to manage platform infrastructure and applications.
  • Work to implement security, reliability, monitoring and performance improvements across local, test, and production infrastructure
  • Work alongside the Head of Platform and Senior Platform Engineers to take platform engineering projects from inception to completion, providing feedback to the team and business as required.
  • Work closely with our development teams, guiding focus on platform technologies for projects.
  • Work closely with the wider business to provide support for infrastructure services.
  • Manage deployments to all environments.
  • Respond to, diagnose and resolve unexpected platform issues.
  • Participate in the on-call rota and provide support during major incidents / emergencies.
  • Communicate incidents in a timely and clear way internally and externally as required.
  • Assist in IT support across the business as required.

 

Minimum Requirements:

  • A strong understanding of how the web works, and its core concepts.
  • 2+ years of Linux system administration (ideally RedHat / CentOS / Fedora).
  • Good understanding of Bash scripting.
  • Experience using Nginx to manage web servers.
  • Experience writing (My)SQL to query complex databases, and an understanding of MySQL replication principles.
  • Thorough knowledge of source control with Git and GitHub.
  • Configuration management or IaC experience (ideally Ansible or Terraform).
  • Understanding of both caching and event queueing concepts
  • Basic knowledge of networking protocols and concepts, including DNS.

 

Bonus Requirements:

Whilst not required, these skills are a bonus:

  • Development skills in any of JavaScript/PHP/Python/Ruby.
  • Previous exposure to the unique issues created at scale.
  • VMWare or virtualisation experience.
  • Docker or containerisation experience.
  • Previous usage of AWS.
  • Previous usage of Cloudflare.
  • Experience with any of Grafana, Prometheus, Memcached, NAXSI, Redis, Sphinx, Zabbix.
Working Type: Hybrid
Job Location: London
Job Type: Full Time

Allowed Type(s): .pdf, .doc, .docx