Site Reliability Engineer
Site Reliability Engineer London (Blackfriars) – 7 monthsCertain Advantage are recruiting on behalf of our prestigious Financial Services client for an SRE Engineer in their AWS DB team who support numerous native DBs like RDS/Aurora/Neptune plus CockroachDB.This is a contract position for 7 months working inside IR35.Lead Site Reliability Engineers (SRE) play an important role in the operations, design and development of our client’s modern public cloud offering. You'll be required to apply your depth of knowledge and expertise to all aspects of the product development lifecycle, as well as partner continuously with your stakeholders on a daily basis to stay focused on common goals. The team embrace a culture of experimentation and constantly strive for improvement and learning. You’ll work in a collaborative, trusting, thought-provoking environment.Responsibilities
- Design, code, test and deliver software to improve our existing systems by adopting DevOps culture.
- Troubleshoot and manage incidents, communicate with stakeholders at all levels, facilitate blameless post-mortems, identify follow-up corrective and preventative actions to ensure permanent closure of incidents
- Actively participate in the development life cycle, ensuring reliability and scalability and operational stability
- Define, create and track application analytics in support of better service level objectives
- Ensure adherence to change management release processes, accelerate automation of these processes
- Run resiliency management planning, scheduling and execution of disaster recovery tests & seek to automate these activities where possible
- Covering on-call schedule when Production support is required outside of working hours
- Participate in enhancing product observability and telemetry, support modernization.
- Brainstorm ideas to simplify and streamline infrastructure by closely working with infrastructure and SRE teams.
- Knowledge of Python / Unix Shell scripting & SQL.
- Good understanding of development tools: source code control software, automated build, automated testing and JIRA.
- Understanding of IaC infrastructure as a code concept is desirable.
- Experience with build automation, test driven development, continuous integration and delivery
- Experience with Relational and non Relational Databases
- Previous SRE experience including knowledge about SLO/SLA/SLI and error budgets, is advantageous
- Experience working or familiarity with one public cloud (AWS, Google or Azure)
- Experience in application and system configuration management of a large fleet (1000's of nodes under management)
- Experience in the use of declarative frameworks (Puppet or Terraform)
- Expertise in leveraging APIs, and security, authentication and data structures
- Expertise in software design using Domain Driven Design, SOLID or GRASP
- Exposure to Micro service architecture, REST API design/development
- Exposure to Docker/ Kubernetes
- Knowledge of source code CI/CD integration
- Company
- Certain Advantage
- Location
- City of London, London, England, United Kingdom
- Employment Type
- Temporary
- Salary
- Salary negotiable
- Posted
- Company
- Certain Advantage
- Location
- City of London, London, England, United Kingdom
- Employment Type
- Temporary
- Salary
- Salary negotiable
- Posted