Site Reliability Engineer

Apply Now

Mid-Level Site Reliability Engineer (SRE)

Are you an experienced Site Reliability Engineer with a passion for building reliable, scalable systems that empower innovation? Our client is looking for a skilled Mid-Level SRE to join our growing technology team.

In this role, you’ll help ensure our infrastructure is stable, secure, and efficient - supporting the applications that drive support our clients.

The Role

We are seeking a mid-level Site Reliability Engineer (SRE) to join our technology team, helping to ensure the smooth operation and reliability of our infrastructure. You’ll play a vital role in maintaining uptime, managing deployments, and supporting other team members.

This is a hands-on position suited for someone who thrives on problem-solving, process improvement, and cross-team communication.

What You’ll Do:

Maintain & Improve Systems

Ensure the reliability, performance, and availability of production systems.
Perform regular updates, patching, and maintenance across environments.
Manage infrastructure provisioning using Terraform, Ansible, and AWS.

Collaborate & Support

Work closely with the junior SRE to develop their practical experience and technical confidence.
Partner with developers, data scientists, and business users to resolve technical issues.

Automate & Optimise

Contribute to configuration management and automation improvements.
Identify and document standard operating procedures.
Implement proactive monitoring measures to detect and prevent issues.

Monitor & Troubleshoot

Troubleshoot system issues using logs, monitoring tools, and a methodical approach.
Oversee and enhance system monitoring with Nagios, with a transition to Datadog.

Incident Management

Support incident management processes, including post-mortems and follow-up actions.
Communicate outcomes with customers clearly and effectively.

What We’re Looking For:

Experience

Proven experience in an SRE, DevOps, or Operations Engineering role.
Strong working knowledge of AWS, Terraform, and Ansible.

Technical Skills

Linux system administration & shell scripting.
Networking fundamentals, containerization, and infrastructure security best practices.
Version control experience (e.g., Git). Strong troubleshooting and root cause analysis skills.

Desirable Skills

Experience with Kubernetes and/or other cloud platforms.
Familiarity with Nagios, Datadog, or similar monitoring tools.
Exposure to CI/CD systems such as TeamCity, AWS CodeBuild, AWS CodePipeline, or ArgoCD.

Personal Attributes

Proactive, curious, and process-driven.
Enjoys collaboration and mentoring.
Calm under pressure, especially during incidents.
Flexible and adaptable to technical and business priorities.

Nice-to-Have

Experience supporting scientific or data-intensive applications.
Background in post-mortem facilitation and follow-up.
Enthusiasm for observability, performance tuning, and cost optimisation.

Company: BOSS Professional Services LTD
Location: Macclesfield, Cheshire, England, United Kingdom
Employment Type: Full-Time
Salary: £50,000 - £60,000 per annum
Posted: 5 hours ago

Apply Now

Company: BOSS Professional Services LTD
Location: Macclesfield, Cheshire, England, United Kingdom
Employment Type: Full-Time
Salary: £50,000 - £60,000 per annum
Posted: 5 hours ago