Description Data System Reliability Engineer (dSRE) Role Overview: A crucial role in CME's Cloud data transformation, the data SRE will be aligned to data product pods ensuring the our data infrastructure is reliable, scalable, and efficient as the GCP data footprint expands rapidly. Accountabilities: Automate data tasks on GCP Work with data domain owners, data scientists and other … data engineers, data scientists, and software engineers to understand data requirements, troubleshoot issues, and support data-driven initiatives. Continuously assess and improve data infrastructure and data processes to enhance reliability, efficiency, and performance. Maintain clear and up-to-date documentation related to data systems, configurations, and standard operating procedures. Qualifications: Bachelor's or Master's degree in Computer Science … Software Engineering, Data Science or related field, or equivalent practical experience Experience as a Data SiteReliability Engineer or a similar role, with a focus on data infrastructure management Proficiency in data technologies, such as relational databases, data warehousing, big data platforms (e.g., Hadoop, Spark), data streaming (e.g., Kafka), and cloud services (e.g., AWS, GCP, Azure). More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
CME Group
exclusive features. CME Group is the world's leading and most diverse derivatives marketplace, offering futures and options across a wide range of industries. We are seeking a passionate SRE to join our dynamic team. The Application SiteReliability Engineer III - Markets will help ensure the reliability and performance of our Markets trading and real-time post … incidents, identifying and mitigating root causes. Collaborate with cross-functional teams to improve system performance and efficiency Act as a mentor to L2 and L1 resources in the Application SRE department. Skills Experience with Linux-based systems and Cloud-based platform(s). Experience and knowledge of working with distributed systems and working with Docker & Kubernetes Exposure to working with … leader in financial services technology. Work on cutting-edge technology in a collaborative and innovative culture. Competitive compensation and benefits package. Opportunity to grow and advance your career in SRE with an organisation who is transforming to this approach Join CME Group and play a crucial role in ensuring the stability and performance of our Markets applications while contributing to More ❯
Social network you want to login/join with: SiteReliability Engineer (Equity only 0.5%), belfast col-narrow-left Client: Location: belfast, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 3 Posted: 10.06.2025 Expiry Date: 25.07.2025 col-wide Job Description: Company Description : Luupli is a social media app that has equity … and the planet. Our team is made up of passionate and dedicated individuals who are committed to making Luupli a success. Role Description : We are seeking an experienced Senior SRE/DevOps Engineer to play a key role in our cloud migration initiative from AWS to Google Cloud Platform (GCP) for our high-traffic social media application. You will be … complies with relevant security standards and policies, implementing practices like VPC configuration, IAM best practices, and Cloud Armor for protection. Required Qualifications : Experience: 5+ years of experience in DevOps, SRE, or cloud operations. Deep expertise in AWS services and hands-on experience with Google Cloud Platform services (Cloud SQL, GKE, Cloud Functions, Cloud Storage, Cloud CDN, etc.). Containerization & Orchestration More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
JR United Kingdom
s leading financial institutions to streamline international payments and ensure compliance at scale - all through smart automation and modern cloud-native infrastructure. They’re looking to bring on a SiteReliability Engineer with deep experience in observability . If you’ve worked with tools like Prometheus in AWS , supported development teams with tracing and performance insights , and thrive … great next step. What You’ll Be Doing: Managing and improving observability tools like Prometheus, Grafana, and CloudWatch Helping product teams with tracing and monitoring to improve performance and reliability Defining and improving … SLIs/SLOs , automating tasks, and reducing operational noise Working with AWS (EKS, EC2, Lambda, RDS), Terraform, and CI/CD tools What They’re Looking For: Experience in SRE or DevOps roles in a production environment Strong knowledge of observability tools , especially Prometheus in AWS Experience with tracing , metrics, and logs to support development teams Skills in Python or More ❯
An amazing Global Investment Client of ours located in Central London is looking for a SiteReliability Engineer to join their team on a permanent basis. This is a rare opportunity and the package offered for this role is up to £300k depending on skills and experience. ABOUT THE COMPANY The company is a leading provider of alternative … think innovatively creating an environment that is fast-paced, dynamic, and successful. ABOUT THE ROLE They are looking for an enthusiastic SiteReliability Engineer to join the SRE team in London. Their team is central to the business as they are responsible for the technology that underpins everything they do; therefore, you will have a direct impact on … be passionate about improving reliability and removing toil by identifying opportunities for automation and building platforms to make the systems more 'reliable by default'. Responsibilities: Evangelise the SRE mindset and implement best practices across the environment. Understand the business and find ways to measure and enhance resilience across the application estate. Eliminate the toil that emerges with complex More ❯
seeking a highly motivated and skilled Application Tools Support Engineer to join our team. This role combines traditional L2 support responsibilities with a SiteReliabilityEngineering (SRE) mindset to ensure the stability, performance, and continuous improvement of our critical DevOps toolchain, including emerging Generative AI (GenAI) tools. Key Responsibilities: The Application Support Senior Analyst provides technical and … source code management systems) and GenAI-powered tools. Troubleshoot and resolve complex technical issues involving in-depth analysis of logs, configurations, system behaviour. Proactively monitor the health, performance, and reliability of supported tools, identifying and addressing potential issues before they impact users. Participate in incident response, performing root cause analysis, and implementing preventative measures to minimize future occurrences across … Experience installing, configuring or supporting business applications. Issue tracking and reporting using tools eg. ServiceNow Knowledge/experience of problem Management Tools. Proven experience in a technical support or SRE role, preferably supporting DevOps tools and technologies. Experience with GenAI or Machine Learning tools is a plus. Strong understanding of DevOps principles and practices, including CI/CD, infrastructure as More ❯
Social network you want to login/join with: SiteReliability Engineer (SRE), Belfast col-narrow-left Client: Location: Belfast, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 8d391ab2c9be Job Views: 5 Posted: 17.06.2025 Expiry Date: 01.08.2025 col-wide Job Description: We are seeking a SiteReliability Engineer (SRE … to join an innovative and fast-growing company in Belfast. This role focuses on ensuring the reliability, scalability, and performance of critical infrastructure and services while working with cutting-edge cloud-native technologies. You'll collaborate with engineering teams to enhance system resilience, streamline deployments, and drive automation. What You'll Do Design, build, and optimise resilient, high … Drive continuous improvements in developer experience (DevEx) and operational efficiency. Participate in incident response, root cause analysis, and post-mortem reviews. What You'll Need 3+ years in an SRE, DevOps, or Infrastructure Engineering role in a high-scale environment. Strong experience with Kubernetes and container orchestration. Deep knowledge of cloud platforms and distributed systems. Proficiency with databases and More ❯
Social network you want to login/join with: Currently for this role we are only able to employ UK citizens Oracle Exadata Database team is building a new cloud native, Autonomous, multi-tenant, database optimized, smart storage cloud service. More ❯
modern technologies, embracing Infrastructure as code at all levels with automation as a core requirement for all projects. We are looking for an Observability Engineer to work within our SRE teams to design, build and iterate on our O11Y platform. This engineer will have to work both hands on and strategically with our architects, global service delivery and product teams … organized in a multi-tasking environment. Self-starter. Outstanding collaboration and communication, and documentation skills with a proven ability to work cross-functionally. • BS/MS in computer science, engineering, or a related technical discipline or equivalent experience. Applicants must have a valid work permit in the UK. In line with Thales' Baseline Security requirements, candidates will be asked More ❯
MCS Group is working with one of their closest clients as they seek to appoint a SiteReliability Engineer to their growing team. An award winning business which has seen exponential growth over the last 2 years off the back of their transformative technology being utilised by organisations across the UK and Ireland and beyond. They've grown … required. Strong knowledge of Linux, Windows, and IP networking, covering routing, DNS, firewalls, and load balancing. Commercial experience with Docker, Kubernetes, and container orchestration. Familiarity with Elasticsearch. Understanding of SRE principles, DevOps, and DevSecOps methodologies. Strong problem-solving skills, attention to detail, and the ability to work autonomously. Full right to work in Ireland or UK. The client is unable More ❯
modern solutions that improve efficiency, speed, and time-to-market. Developer Experience Enhancement: Drive organizational and cultural change to enhance developer productivity through automation, improved workflows, and modern software engineering principles. Value Optimization: Help clients maximize the return on their current IT service management tools and frameworks. Future-Ready Roadmaps: Design progressive strategies that integrate next-generation technologies such … transformation programs—redefining processes, tools, and technical architectures. Experience required: We’re looking for passionate individuals with the following qualities and experience: Expertise in designing lean, automated processes across engineering, operations, and product functions. Strong client-facing experience with the ability to coach and lead agile and … DevOps transformations. Ability to define and track performance metrics (OKRs/KPIs), and implement product-focused operating models. Deep knowledge of cloud-native technologies, DevSecOps, CI/CD pipelines, SRE practices, and observability tools. A collaborative mindset with a strong understanding of cloud architecture in hybrid and multi-cloud environments. #J-18808-Ljbffr More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
Citigroup Inc
and support to the Commodities trading business (Trading, Operations, Middle Office, Downstream partners) . Deliver efficiency and stability through automation, failover testing and incident and problem management lifecycles. Use SiteReliabilityEngineering methods to improve availability and performance of applications. Deliver TOIL reduction via automation, resiliency and observability. Manage, triage, communicate and resolve live business impacting incidents. … and capacity with proposed solutions. Work to implement scalable solutions from a tooling catalogue. What we’ll need from you: Relevant experience in an Application Support role, ideally with SiteReliabilityEngineering experience. Knowledge/experience of Problem Management Tools and the Incident management process. Demonstratable experience in providing Automation and TOIL reducing solutions. Ability to demonstrate More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
Jobs via eFinancialCareers
and support to the Commodities trading business (Trading, Operations, Middle Office, Downstream partners) . Deliver efficiency and stability through automation, failover testing and incident and problem management lifecycles. Use SiteReliabilityEngineering methods to improve availability and performance of applications. Deliver TOIL reduction via automation, resiliency and observability. Manage, triage, communicate and resolve live business impacting incidents. … and capacity with proposed solutions. Work to implement scalable solutions from a tooling catalogue. What we'll need from you: Relevant experience in an Application Support role, ideally with SiteReliabilityEngineering experience. Knowledge/experience of Problem Management Tools and the Incident management process. Demonstratable experience in providing Automation and TOIL reducing solutions. Ability to demonstrate More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
JR United Kingdom
Digital Architecture/Transformations & NextGen AI Solutions, with offices in London, Manchester, Newcastle and Glasgow (offering a 'flexi working model' of remote first or hybrid + ad hoc client site travel when required). Clients include Government, Finance, Retail & Energy sectors (among other private sectors). They are looking for a Managing Consultant - DevOps/Cloud Advisory Consultant to … Proven experience in 'DevOps Advisory', creating DevOps/Cloud Best Practice recommendations, strategies and solutions Proven experience in lean process design and IT/business automation across product management, engineering, and operations. Ability to define OKRs/KPIs … set up processes to measure and improve speed, productivity, and quality, and implement product-centric operating models. Solid understanding of hybrid/multi-cloud environments, DevOps, CI/CD, SRE, DevSecOps models, DevX, build and deployment pipelines, observability, and ITIL. Proven experience leading/managing/mentoring a team of DevOps/SRE/Platform professionals. Presales/Sales experience More ❯
will also interact with customers, share knowledge and mentor those around you. Your key responsibilities will include: Working as part of a team - You'll work alongside colleagues in engineering, testing, consulting, product management and security capabilities to build, test and deploy software of the highest quality, including the operation and continuous improvement of underlying platform services. AWS Expertise … a broad know-how around how applications and services are constructed using the AWS platform with Terraform. Designing, building, testing, automating, monitoring and/or operating, enhancing and improving reliability of modern digital service platform in production environments. Using the latest Continuous Delivery and automation techniques for releasing operationally ready software to production, including platform orchestration using multiple configuration … Building and/or supporting modern digital service platforms to include architecture, design, estimation, planning and iteration on MVP using a modern agile approach. Championing best practice use of SiteReliabilityEngineering tooling, techniques and processes to manage and improve existing platform services. E.g. monitoring, platform modernisation and cost optimisation. Use of AWS cloud platform including SaaS More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
JR United Kingdom
/GitLab) Skilled in multiple programming languages, ideally including: .Net (C#) PowerShell T-SQL Deep understanding of the software development lifecycle and Agile methodologies Proven experience in DevOps/SRE practices, CI/CD pipelines and Infrastructure as Code (Terraform, Bicep, ARM) Solid understanding of API development and modern cloud architecture principles Relevant certifications (Azure, DevOps, Cloud Adoption Framework) Salary More ❯
SiteReliability Engineer - Healthcare Technology Location: Belfast, United Kingdom Job Type: Full-time, Permanent Work Arrangement: Hybrid Client: iO Associates - UK/EU EU Work Permit Required: Yes Job Views: 2 Posted: 06.06.2025 Expiry Date: 21.07.2025 Job Description We're working with a leading healthtech company to find a SiteReliability Engineer to support and … optimise the platforms behind critical clinical systems. This role offers flexibility, technical challenge, and the opportunity to make a direct impact on healthcare delivery. You will join a collaborative SRE team focused on maintaining cloud and on-premise environments, improving deployment pipelines, reducing manual work, and supporting project delivery. You will work closely with internal teams across software development, support … and delivery. Key Technologies Linux Windows Server We are seeking enthusiastic individuals with experience in SRE or DevOps roles, especially in environments utilizing containerised and cloud-based applications. Strong communication skills and the ability to work across teams are essential. Applicants must have the right to live and work in the UK. If you are interested in making a difference More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
A&O Shearman
You will stand out if you bring Experience with the Elite 3e application. Experience with the Intapp suite of applications (Open/Conflicts/Walls). Experience in a SiteReliabilityEngineering or related role. What we can offer you We recognise that our people are our most valuable asset, which is reflected in the wide range More ❯
the opportunity to play an integral role across the whole customer journey. Responsibilities Managing the rollout of Cloudsmith across enterprise businesses Building relationships with devOps, platform engineering, and SRE teams to optimise deployments and adopt industry best practices for supply chain security Working closely with development, operations, and product teams internally to align on customer needs Analysing product usage More ❯