CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaosengineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools and Techniques: Experience in using cloud native monitoring tools like More ❯
CD pipeline management and DevOps practices. Strong understanding of disaster recovery and business continuity planning. Experience with performance tuning and capacity planning. Understanding of chaosengineering principles and practices. Skills in cost optimization for cloud infrastructure. Specific Tools and Techniques: Experience in using cloud native monitoring tools like More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
慨正橡扯
and automating financial processes so they can thrive in a digital world. We're looking for a QA Engineer to join our Site Reliability Engineering (SRE) team within Cloud Services Engineering & Operations. In this role, you'll play a key part in ensuring the reliability, performance, and resilience … automation frameworks for cloud infrastructure and application reliability. Support incident response efforts by diagnosing, documenting, and resolving production issues. Drive resilience testing strategies, including chaosengineering, to strengthen system robustness. Participate in Agile ceremonies, contributing to continuous improvement initiatives. Coordinate User Acceptance Testing (UAT) to ensure smooth product … with test automation frameworks and scripting languages (Python, Java, JavaScript). Experience with containerized applications (Docker, Kubernetes). Exposure to resilience testing tools (e.g., Chaos Monkey, Gremlin). Familiarity with Infrastructure as Code (IaC) tools (Terraform, CloudFormation). More ❯
take ownership of areas like: Observability tooling Security validation Recovery procedures Performance & load testing Disaster recovery planning Be the voice of risk in the engineering organisation—assessing, articulating, and influencing decisions around testing and risk. You’ll be empowered to draw a hard line when needed. Act as a … including: Coaching and mentoring QA Engineers (4–6 across 2 scrum teams) Running workshops to promote best practices Influencing the wider quality culture and engineering mindset Leading non-functional test coverage improvements Experience Needed You consider yourself a leader first, manager second. You’ve followed a hands-on QA … re confident across the QA spectrum—this isn’t a pure performance testing role, but depth in that area is a bonus. Experience with chaosengineering (especially Azure Chaos Studio) is highly desirable but not essential. Exposure to AI/ML-powered products is a strong plus More ❯
Monitor, Log Analytics, App Insights, and Prometheus/Grafana . Design for high availability (HA), disaster recovery (DR), and business continuity (BCP) . Conduct chaosengineering to test resilience and fault tolerance. Work closely with development, security, and operations teams to drive cloud initiatives. Provide technical mentorship and More ❯
and Amazon EKS. Preferred qualifications include: Prior involvement in the Fintech sector or other regulated industries. Familiarity with the Grafana observability stack. Experience in ChaosEngineering methodologies. About Convera Convera is the largest non-bank B2B cross-border payments company in the world. Formerly Western Union Business Solutions More ❯
and Amazon EKS. Preferred qualifications include: Prior involvement in the Fintech sector or other regulated industries. Familiarity with the Grafana observability stack. Experience in ChaosEngineering methodologies. Your expertise will be instrumental in fortifying our infrastructure and delivering exceptional reliability to our customers. About Convera Convera is the More ❯
hard skills Triage, troubleshoot and resolve issues using golden signals and go past golden signals Go past golden signals with additional principles such as chaosengineering to detect failure points and lead Game days for testing resiliency of team when it comes to incident response and remediations and … synthetic monitoring. Lead SRE team members to create and maintain Recovery Procedures, RCA's in collaboration with other engineering teams. Ensure Incidents assigned to the team are being managed within agreed SLAs Ensure alarms are documented in up to date Knowledge Base Articles. Ensures Production infrastructure is up to … administration in Amazon AWS. Tax/Accounting domain experience Bachelors or Master's in Computer Science discipline. 5+ years' experience focussed on Site Reliability Engineering or related position in AWS Cloud Platform. At least 2 AWS Certifications are must. (AWS Sysops Admin and Architects certifications preferred). Experience working More ❯
to have: Experience in Cloud technologies, particularly the AWS stack. Experience in non-functional testing such as vulnerability, compatibility, usability and performance testing. Also chaos engineering. Experience in RESTful API testing. Experience in writing SQL. Experience in using industry-standard tools and frameworks that can simulate complex multi-system More ❯
looking to broaden their skillset. After developing your specialist skills you are now looking for opportunities to grow and learn more about wider resilience, chaosengineering and cloud services - we will support, provide guidance and mentor you. Nevertheless, we are open to other experiences as we are creating More ❯