FX or crypto trading; front-end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile approach, passionate about clean architecture More ❯
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks More ❯
FX or crypto trading; front-end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile approach, passionate about clean architecture More ❯
Join us as a Cloud Observability Engineer at Barclays, where you will lead our enterprise observability strategy across multi-cloud environments. This senior role combines technical leadership with team management, driving operational excellence while architecting resilient solutions and mentoring high-performing teams. To be successful as a Cloud Observability Engineer, you should have experience with The ability to lead and … scale technical teams in multi-faceted governance environments AWS/Azure cloud platforms and enterprise observability tools (Elastic, Grafana, Splunk, DataDog, or similar) SRE/DevOps methodologies with Python proficiency for automation and infrastructure-as-code practices Some other highly valued skills may include AWS or Azure cloud certifications Experience implementing AI-driven observability and AIOps solutions Background in large More ❯
testers and operations to automate builds, deployment and release of applications running in the cloud and on-premise Provide guidance on industry best practices for software deployment, development, and observability Engineer tooling to implement those practices Assist and architect where appropriate solutions using containerisation and serverless technologies Drive automation for environment management, logging and monitoring Engage with vendors and service … stack CI/CD, GitLab, Jenkins, Sonatype Nexus Knowledge and working experience of containerising application components including writing DockerFiles and deploying to Kubernetes Deep understanding of pipelines as code Observability concepts and tooling; Opensearch, Cribl, Grafana, Prometheus, CloudWatch #J-18808-Ljbffr More ❯
Stoxx's GCP platform infrastructure Ensure the platform's scalability, reliability, and efficiency meets business and client requirements Develop, build and support a robust CI/CD pipeline and observability stack Be the go-to person for the most critical Platform issues, leading cross-functional teams where necessary, to deliver best-in-class engineering solutions. Drive continuous improvement initiatives to … Experience working in a global or multinational team setting Strong documentation, communication and collaboration skills Proven ability to drive innovation and continuous improvement initiatives Focus on simplicity, automation and observability Expertise in Python, GitHub Actions, Apigee, Airflow Expertise in Observability tooling such as Prometheus/Grafana, ELK, Splunk or similar Bachelor's or Master's degree in Computer Science or More ❯
For: 3+ years hands-on experience with Solace PubSub+ in a production environment Strong knowledge of WAN-based distributed systems and networking fundamentals Experience with Prometheus and Grafana for observability and alerting Confident in Linux/Unix systems and scripting (Bash, Python, etc.) Excellent problem-solving instincts and attention to detail Strong communicator who works well across technical teams Bonus More ❯
documents, to implementing clean solutions that address them. Hands-on experience with infrastructure: whether you’ve been part of an on-call rotation or just working day-to-day observability tools, you are comfortable rolling up your sleeves to understand the factors at play in an incident or service degradation. Pragmatism: You are comfortable balancing “perfect” with “good enough,” as More ❯
a bias for Infrastructure (Python, Go, C#) • IAM Policy and Authentication/Authorization schemes • Web Services and REST API • Databases and Storage Systems • Development Build, Test, and Deployment Pipelines • Observability and Monitoring (Open Telemetry, TIG and ELK stacks) #LI-JS2 Together, as owners, let's turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and More ❯
Locations : Canary Wharf Boston Who We Are Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in More ❯
our production systems. Key Responsibilities Design, implement, and manage AWS cloud infrastructure. Develop and maintain automation scripts and tooling. Support production systems and ensure high availability and performance. Implement observability and monitoring solutions. Collaborate closely with the PBS (Platform/Backend Services) team. Contribute to infrastructure as code (IaC) and DevOps best practices. Requirements Hands-on experience with AWS. Automation … experience (e.g., Terraform, Ansible, CI/CD tools). Strong understanding of infrastructure and cloud architecture. Experience supporting production environments. Familiarity with observability tools (e.g., Prometheus, Grafana, CloudWatch). Excellent problem-solving and communication skills. Desirable Experience working in a fast-paced or agile development environment. Familiarity with container technologies (e.g., Docker, Kubernetes). Previous experience in a similar role More ❯
our production systems. Key Responsibilities Design, implement, and manage AWS cloud infrastructure. Develop and maintain automation scripts and tooling. Support production systems and ensure high availability and performance. Implement observability and monitoring solutions. Collaborate closely with the PBS (Platform/Backend Services) team. Contribute to infrastructure as code (IaC) and DevOps best practices. Requirements Hands-on experience with AWS. Automation … experience (e.g., Terraform, Ansible, CI/CD tools). Strong understanding of infrastructure and cloud architecture. Experience supporting production environments. Familiarity with observability tools (e.g., Prometheus, Grafana, CloudWatch). Excellent problem-solving and communication skills. Desirable Experience working in a fast-paced or agile development environment. Familiarity with container technologies (e.g., Docker, Kubernetes). Previous experience in a similar role More ❯
both strategic vision and the ability to dive deep into technical challenges. Responsibilities Lead and Manage the Platform Engineering Initiatives Define and execute the technical roadmap for platform infrastructure, observability, and developer experience Drive DevOps, SRE, and Infrastructure initiatives to ensure platform reliability and performance Foster a culture of automation, observability, and continuous improvement Architect and Implement Scalable Solutions Design … optimal performance and scalability across all regions Own Platform Reliability and Operations Define and maintain SLOs/SLIs/SLAs for critical platform services Implement comprehensive monitoring, alerting, and observability solutions Design and maintain disaster recovery and business continuity plans Lead incident response and post-mortem processes Optimize Platform Performance and Costs Implement strategies to optimize infrastructure costs without compromising … in solving complex technical issues Contribute to codebases as needed to drive projects forward Requirements Technical Expertise Proven experience managing Kubernetes clusters and expertise in container orchestration. Experience with observability tools (e.g., DataDog, Prometheus, Grafana) Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation Experience in Database optimization and management (especially for multi-tenant architectures) Extensive knowledge of More ❯
the evolution of our platform's microservices ecosystem. What You'll Do Architect, build, and maintain scalable Python microservices deployed in cloud environments Lead architectural decisions focusing on performance, observability, fault tolerance, and scalability Own complex backend features end-to-end-design, implement, test, deploy, and monitor Mentor and guide engineers through code reviews, design discussions, and best practices Collaborate More ❯
multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional Cloud Architect, Azure Solutions Architect Expert. Experience in data architecture, AI/ML integration, and observability frameworks . #J-18808-Ljbffr More ❯
execution of disaster recovery tests & seek to automate these activities where possible Covering on-call schedule when Production support is required outside of working hours Participate in enhancing product observability and telemetry, support modernization. Brainstorm ideas to simplify and streamline infrastructure by closely working with infrastructure and SRE teams. Required qualifications, capabilities and skills Knowledge of Python/Unix Shell More ❯
Architectures (Kafka). Collaborate with DevOps teams to implement CI/CD pipelines and infrastructure as code using tools like Terraform, CloudFormation, and Ansible. Implement and manage monitoring and observability tools such as Datadog. Ensure real-time logging, alerting, and troubleshooting capabilities. Collaboration & Stakeholder Management: Work closely with business units, developers, and IT teams to understand requirements and translate them More ❯
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
optimize data flow, connectivity, and interoperability Help to implement best practices and process improvements to enhance delivery efficiency and team performance Work with various internal teams to continuously improve observability and supportability capabilities of the company platform Key Requirements A highly motivated, technical and detail-oriented support engineer, able to work autonomously with minimal direction, passionate about learning new things More ❯
focus on goals and the ability to balance multiple priorities in a fast-paced environment. DESIRED SKILLS AND EXPERIENCE: Real-time and low latency market data experience Service orchestration, observability and monitoring platform experience Solid understanding of a Programming Language (preferably Python) Agile tools (Jira, GIT among other DevOps principles) LSEG is a leading global financial markets infrastructure and data More ❯
on AWS are key to our next phase of growth, are written to 12-factor principles and fit into our microservices architecture Cloud-related tools, services, and distributed system observability to support these applications, such as Docker, Kubernetes, ElasticSearch, log management systems, and Datadog APM, to name but a few API specifications, conforming to the OpenAPI (Swagger) standard, provide a More ❯
focus on goals and the ability to balance multiple priorities in a fast-paced environment. DESIRED SKILLS AND EXPERIENCE: Real-time and low latency market data experience Service orchestration, observability and monitoring platform experience Solid understanding of a Programming Language (preferably Python) Agile tools (Jira, GIT among other DevOps principles) LSEG is a leading global financial markets infrastructure and data More ❯
US, and India Advanced experience with AWS, Azure, or GCP and large-scale legacy-to-cloud migration programs Proven record implementing DevOps/CloudOps practices, including IaC, autmation, and observability Hands-on experience with AI code-generation tools (e.g. GitHub Copilot, Cursor.io, Windsurf, Devin) Exceptional communication and stakeholder management skills, translating technical strategy into measurable business impact; Able to work More ❯
Site Reliability/DevOp Engineer London - 5 Days Onsite Up to £550 per day (Umbrella, Inside IR35) 12-Month Contract Must hold live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this More ❯
Site Reliability/DevOp Engineer London - 5 Days Onsite Up to £550 per day (Umbrella, Inside IR35) 12-Month Contract Must hold live and transferrable DV Clearance Are you passionate about reliability, automation, and supporting mission-critical systems? Join this More ❯