Site Reliability Engineer

Service Reliability Engineer (SRE) / DevOps Engineer

A great opportunity for a SRE Engineer to be working with one of Europe's largest tech companies.

The Service Reliability Engineer combines excellent software engineering, IT operations skills and database expertise. This role will help software development teams' ability to reliably and consistently deliver applications by developing standardized automation to control, build, artifact and deploy managed services.

Roles and responsibilities of the assignment:

  • Automation design & development: Work jointly with stakeholders to propose and design re-usable, fault-tolerant and QA proven automation. Participate in the coding of automation and lead development of automation artefacts.
  • Work closely with technical leads and developers to assist in the design of databases to support business applications, ensuring system scalability, security, performance, and reliability.
  • Technological evolution: Supports projects that aim to introduce improvements to the applications owned. Actively contributes to the migration of applications to the public cloud (Azure).
  • Incident and Problem Management: Drive and support rapid resolution of incidents by involving relevant stakeholders and directing incident recovery process. Use automation and tooling directly where applicable.
  • Has the ability to work independently on technical and functional tasks and is able to guide team members in one or more areas of work. Open-minded about acquiring new skills and is able to carry the task out by soliciting help from relevant groups.

Relevant Work Experience:

  • Solid experience in DevOps with a software development background (Python, C++, Go, Github/Bitbucket, Jira, Confluence)
  • Cloud experience and good understanding of Kubernetes and OpenShift
  • Hands on experience deploying, testing, and building CI/CD pipelines
  • Experience working with Monitoring and Logging systems, particularly Splunk, Prometheus & Grafana
  • Excellent analysis, debugging, root-cause identification, and troubleshooting skills
  • Hands on experience with Oracle Databases and willingness to increase expertise (OCA or OCP certification is a plus)
  • Strong experience in Linux/Unix based environments
  • Excellent interpersonal skills, self-motivation and the ability to work in a matrix environment

Skills

Computing: Linux/Unix, Datastore Technology, Azure

Languages Required : Python, Ansible Desirable: C++, Go

Specific knowledge : Automation design & automation QA, Public Cloud, monitoring

Other: Performance Optimization & Efficient Algorithms, Agile

Company
mthree
Location
West Drayton, Nottinghamshire, UK
Posted
Company
mthree
Location
West Drayton, Nottinghamshire, UK
Posted