Permanent Observability Job Vacancies

351 to 375 of 400 Permanent Observability Jobs

Senior Software Engineer - NSPE Firewall

London, United Kingdom
Bloomberg L.P
tools to manage a large-scale, multi-vendor network with an emphasis on automation, telemetry, and model-driven infrastructure as code. Automate the full network lifecycle-including provisioning, configuration, observability, testing, troubleshooting, and capacity planning. Collaborate with architecture and design teams and the CTO office to implement new technologies that ensure scalability, efficiency, and operational resilience. Develop tools and platforms … that enhance the observability, reliability, and performance of the production network. Enhance existing monitoring and observability frameworks, integrating intelligent alerting and self-remediation capabilities to reduce manual intervention and improve incident response. Define and measure service-level objectives (SLOs) to track infrastructure performance and reliability. Write software utilizing orchestration systems to automate tasks and interact with other systems. Provide mentorship More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Software Engineer / SRE

Leeds, West Yorkshire, Yorkshire, United Kingdom
Hybrid / WFH Options
Fruition Group
Software Engineer/SRE JavaScript/TypeScript, Node.js, AWS, Observability Leeds/Hybrid, c. 2x per week Salary up to £65,000 We're looking for a Software Engineer with strong AWS and Observability experience to join a growing engineering team in Leeds. This is a hybrid role, giving you the flexibility to split your time between home and a … improving platform performance and automation, while collaborating with developers, product teams, and operations. What you'll be doing: Building and maintaining scalable cloud infrastructure in AWS Implementing and improving observability tools (monitoring, logging, tracing) Automating deployments and improving CI/CD pipelines Driving reliability, availability and performance across systems Working with developers and SREs to solve complex problems What we … re looking for: Strong experience with AWS (EC2, ECS, Lambda, RDS etc.) Good knowledge of observability tools (Grafana, Prometheus, OpenTelemetry, Datadog, or similar) Background in software engineering (JavaScript/TypeScript & Node.js, although any language is fine) Experience with Infrastructure as Code (Terraform, CloudFormation, or similar) CI/CD pipelines and automation experience What's on offer: Salary up to More ❯
Employment Type: Permanent, Work From Home
Salary: £65,000
Posted:

Platform Engineer

Caldecotte, Milton Keynes, Buckinghamshire, England, United Kingdom
Connells Group HQ
mindset, working directly with development teams to understand their needs and deliver solutions. You will work across multiple technical domains including orchestration, automation, CI/CD pipelines, cloud services, observability, and security, developing deeper expertise in areas that align with platform priorities and your interests. Experience with Microsoft Azure is essential.You will play your part in operating the platform aligned … with Docker and basic Kubernetes concepts Understanding of cloud networking concepts (VNets, subnets, NSGs) Awareness of cloud security best practices and compliance requirements Basic knowledge of monitoring, logging, and observability tools Understanding of cloud cost management and resource optimisation principles Comfort with troubleshooting and supporting development teams Understanding of service reliability and incident response practices Connells Group UK is an More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

Lead Test Engineer - HO - G7

London, United Kingdom
Manchester Digital
whole range of tools. The below is sample snapshot of some tools we use: Frameworks: Serenity BDD (Selenium + RestAssured), Playwright, and several bespoke frameworks. Performance testing - Locust, JMeter. Observability and Monitoring: ELK, Grafana, Dynatrace. What you will do Your main day to day responsibilities will be: Ownership of existing test frameworks and actively reviewing them, identifying need for new … parallel tests. Using container and cloud technology to orchestrate faster, scalable tests while optimising costs. Working with teams to promote use of code scan, code quality, accessibility testing and observability tools. Participating in the QAT Test Engineering community as well as QAT recruitment to help grow it. Person specification As aLead Test Engineer, you will have experience of: Programming using More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Test Engineer - HO - G7

Croydon, London, United Kingdom
Manchester Digital
whole range of tools. The below is sample snapshot of some tools we use: Frameworks: Serenity BDD (Selenium + RestAssured), Playwright, and several bespoke frameworks. Performance testing - Locust, JMeter. Observability and Monitoring: ELK, Grafana, Dynatrace. What you will do Your main day to day responsibilities will be: Ownership of existing test frameworks and actively reviewing them, identifying need for new … parallel tests. Using container and cloud technology to orchestrate faster, scalable tests while optimising costs. Working with teams to promote use of code scan, code quality, accessibility testing and observability tools. Participating in the QAT Test Engineering community as well as QAT recruitment to help grow it. Person specification As aLead Test Engineer, you will have experience of: Programming using More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Test Engineer - HO - G7

Glasgow, United Kingdom
Manchester Digital
whole range of tools. The below is sample snapshot of some tools we use: Frameworks: Serenity BDD (Selenium + RestAssured), Playwright, and several bespoke frameworks. Performance testing - Locust, JMeter. Observability and Monitoring: ELK, Grafana, Dynatrace. What you will do Your main day to day responsibilities will be: Ownership of existing test frameworks and actively reviewing them, identifying need for new … parallel tests. Using container and cloud technology to orchestrate faster, scalable tests while optimising costs. Working with teams to promote use of code scan, code quality, accessibility testing and observability tools. Participating in the QAT Test Engineering community as well as QAT recruitment to help grow it. Person specification As aLead Test Engineer, you will have experience of: Programming using More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Test Engineer - HO - G7

Sheffield, Yorkshire, United Kingdom
Manchester Digital
whole range of tools. The below is sample snapshot of some tools we use: Frameworks: Serenity BDD (Selenium + RestAssured), Playwright, and several bespoke frameworks. Performance testing - Locust, JMeter. Observability and Monitoring: ELK, Grafana, Dynatrace. What you will do Your main day to day responsibilities will be: Ownership of existing test frameworks and actively reviewing them, identifying need for new … parallel tests. Using container and cloud technology to orchestrate faster, scalable tests while optimising costs. Working with teams to promote use of code scan, code quality, accessibility testing and observability tools. Participating in the QAT Test Engineering community as well as QAT recruitment to help grow it. Person specification As aLead Test Engineer, you will have experience of: Programming using More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead Test Engineer - HO - G7

Manchester, Lancashire, United Kingdom
Manchester Digital
whole range of tools. The below is sample snapshot of some tools we use: Frameworks: Serenity BDD (Selenium + RestAssured), Playwright, and several bespoke frameworks. Performance testing - Locust, JMeter. Observability and Monitoring: ELK, Grafana, Dynatrace. What you will do Your main day to day responsibilities will be: Ownership of existing test frameworks and actively reviewing them, identifying need for new … parallel tests. Using container and cloud technology to orchestrate faster, scalable tests while optimising costs. Working with teams to promote use of code scan, code quality, accessibility testing and observability tools. Participating in the QAT Test Engineering community as well as QAT recruitment to help grow it. Person specification As aLead Test Engineer, you will have experience of: Programming using More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Software Engineer in Test

London, United Kingdom
Hybrid / WFH Options
LinuxRecruit
This is a fast-expanding company at the forefront of odds comparison, where innovation converges with excitement. Here you can experience the best of both worlds, working within a close-knit team with autonomy while enjoying substantial financial backing from More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

AppSec Lead

Central London, London, United Kingdom
Hybrid / WFH Options
Halian Technology Limited
A leading fintech company is seeking a Lead AppSec Engineer to join their established team. Youll be instrumental in embedding security into every stage of the software development lifecycleguiding engineers, shaping best practices, and driving secure, scalable solutions across our More ❯
Employment Type: Permanent, Work From Home
Posted:

Data Scientist (Machine Learning Observability & Governance)

Manchester, Lancashire, United Kingdom
Hybrid / WFH Options
Starling Bank
problems and challenges, who can work across teams do great things here at Starling, to continue changing banking for good. Responsibilities: As a Data Scientist in the Machine Learning Observability & Governance team, you will play a crucial role in enabling Starling Bank to maximally exploit AI in line with its risk appetite, while ensuring ethical and responsible AI practices. Your … responsible. Stakeholder Communication & Visibility: Ensure clear communication and good visibility with stakeholders such as risk teams, regarding how data scientists at Starling observe and manage ML and AI models. Observability Centre of Excellence: Support colleagues in enhancing their observability work by maintaining existing observability tooling, assisting in identifying key metrics to monitor, and providing expert advice on internally-developed model More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Lead DevOps Engineer (Data)

London, United Kingdom
Hybrid / WFH Options
LGBT Great
a key role in scaling and supporting our data systems, which leverage a modern AWS stack and Snowflake. This is a high-impact role with direct influence over reliability, observability, and the DevOps maturity of our data engineering function. Key Responsibilities Platform Ownership Own and manage the data platform infrastructure built on AWS services (EventBridge, Lambda, EC2, MWAA, S3). … Snowflake, and support its integration into the broader data ecosystem. Infrastructure and System Reliability Ensure platform reliability, availability, and scalability across environments. Design and maintain robust monitoring, alerting, and observability frameworks to reduce MTTR and improve visibility. Lead and manage initiatives related to data lineage, platform health, and alert hygiene. CI/CD and Automation Enhance and expand our CI … and operating production data platforms within AWS. Strong understanding of AWS core services: EventBridge, Lambda, EC2, S3, and MWAA (Managed Workflows for Apache Airflow). Experience with infrastructure reliability, observability tooling, and platform automation. Solid experience with CI/CD pipelines, preferably Bitbucket Pipelines. Familiarity with Snowflake administration and deployment practices. Comfortable working through ambiguity and in cross-functional, collaborative More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Infrastructure Engineer - Long Term Project - Los Angeles (Hybrid)

Los Angeles, California, United States
Hybrid / WFH Options
INSPYR Solutions
on reliability engineering to deliver robust and maintainable systems. You will work on network design, traffic analysis and engineering, maintaining CI/CD pipeline and creating tools to enhance observability and streamline troubleshooting for core infrastructure services. Your role will include: Designing, deploying, and operating the global network: Plan, build, and maintain both new and existing infrastructure to deliver the … system reliability, and enable rapid scaling. Developing customer-centric tooling: Build tools to simplify and streamline the consumption of cloud resources for internal teams, empowering them to innovate faster Observability and troubleshooting: Enhance monitoring and logging systems to quickly detect, debug, and resolve issues across our infrastructure Mentorship and continuous learning: Guide and mentor junior and senior engineers in systems … engineers across various timezones to maximize coverage, responsiveness, and global reach. Responsibilities: Solve complex challenges independently, diagnosing and resolving production issues across globally distributed systems. Advance our monitoring and observability platforms, driving innovation that keep our infrastructure visible, actionable, and resilient. Troubleshoot live incidents (on-call rotation) and design resilient solutions to maintain uptime and meet SLAs, continually evolving our More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Senior Software Engineer, Front End

Cambridge, Cambridgeshire, United Kingdom
Futureshaper.com
reusability. Implement responsive, accessible, and performant UIs optimized for data-rich and interactive workflows. Implement DevOps/GitOps practices for automated testing, deployment, and monitoring. Ensure security, scalability, and observability of front-end services in cloud environments (AWS). Ensure robust unit, integration, and end-to-end test coverage to maintain long-term code quality. An eye for optimal and … suites (Jest, React Testing Library, Cypress). Experience with core AWS services (e.g., EC2, S3, Lambda) and infrastructure-as-code using AWS CDK. Experience with system design, performance optimization, observability, and operational excellence during parallel LLM streams. Strong intuition for UX design and a demonstrated commitment to building delightful, workflow-first products. Excellent communication skills and collaborative mindset, especially in … fast-moving, cross-functional environments. Preferred Qualifications Background in scientific domains such as biology, chemistry, or complex systems is a plus but not required. Familiarity with system evaluation and observability tools (e.g., Grafana, Langfuse, Kibana, Cloudwatch) and managing SLAs in production environments. Why Join Us? By joining this initiative within Flagship's Pioneering Intelligence group, you will: Help define a More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Staff Software Engineer - Platform Strategy for Expansion

London, United Kingdom
Burns Sheehan
to solve complex challenges. Drive innovation around cloud-native technologies and platform automation. Balance strategic vision with 30% hands-on coding and design work. Promote best practice in reliability, observability, and scalability. The Ideal Staff Software Engineer Proven experience operating at Staff+ level within a fast-paced engineering organisation. Strong background in cloud platforms (AWS or GCP) and deep knowledge … ability to build operators. Strong coding skills in Golang, Java, or C#, with experience in distributed systems. Demonstrated leadership across multiple squads and technical roadmaps. Expertise in operational excellence: observability, reliability, automation. This is an outstanding opportunity for a Staff Software Engineer join a rapidly scaling company where you'll play a pivotal role in shaping the technical foundations of More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Engineering - Senior Backend Engineer

London, United Kingdom
Hybrid / WFH Options
quench.ai
enhancing our proprietary search engine , indexing and querying structured and unstructured data. Collaborate closely with the AI team to deliver intelligent, contextual responses to user queries. Ensure high performance, observability, and resilience across all backend services. Contribute to technical strategy , code reviews, and overall engineering best practices. You may be suited for this role if you meet the following criteria … 5+ years of backend development experience. Expertise in Python and cloud-based architectures (preferably GCP). Strong understanding of modern software development best practices, including CI/CD, containerization, observability, and microservices . Experience with data integrations and APIs , particularly across enterprise tools. Familiarity with search indexing and large-scale data pipelines is a strong plus. Strong understanding of system More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Director - Network Operations

London, United Kingdom
Hybrid / WFH Options
Planet Paymet
other relevant compliance frameworks. Cost & Budget Ownership: Optimise network OPEX and vendor spend through strategic negotiations, vendor consolidation, and improved service efficiency. Automation & Monitoring: Drive adoption of network monitoring, observability, and automation tools to proactively detect, predict, and resolve issues. Team Leadership: Build and mentor a high-performing global Network Operations team, instilling a culture of accountability, service excellence, and … switching, firewalls (FortiGate, Check Point), load balancers (F5), and SD-WAN technologies. Experience in managing ISP peering, public IP allocations, redundancy, and DDoS protection strategies. Strong understanding of automation, observability, and network monitoring tools to drive proactive operations. Skilled in incident, change, and problem management processes within ITIL or similar frameworks. Excellent communicator with the ability to engage both executive More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Data Management Professional - Data Engineering - Equity Corporate Actions

New York, United States
Bloomberg
passion for finance, data, and technology who has extensive experience building data management solutions. You'll be responsible for strategizing, designing, and implementing data pipelines and remediation workflows, ensuring observability, transparency, and continuous improvement of pipeline performance and quality of output. In this role, you'll also act as a technical leader-guiding design decisions, mentoring team members, and owning … loading both structured and unstructured data from diverse and numerous sources, leveraging Bloomberg's technology stack Lead the development and implementation of proactive programmatic data quality strategies with enhanced observability, transparency, and robust remediation workflows, enabling rapid identification and resolution of data issues with minimal client disruption Use your analytical experience to analyze internal processes to identify gaps and opportunities More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:

Machine Learning Engineer

London, United Kingdom
Hybrid / WFH Options
Ravelin
Who are we? Hi! We are Ravelin! We're a fraud detection company using advanced machine learning and network analysis technology to solve big problems. Our goal is to make online transactions safer and help our clients feel confident serving More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Monitoring Engineer

Manchester, Lancashire, England, United Kingdom
Hybrid / WFH Options
Eutopia Solutions ltd
availability, performance, vulnerability, and compliance monitoring you will operate, administer, and engineer monitoring solutions within this complex environment. You will also play a part in looking to monitoring and observability platform technical design and architectural decisions and changes. This includes the design and implementation of new monitoring systems integrations. This is an exciting time to join them as they continue … their journey with Azure and look to build out automated functions within the operation.They are looking to an individual that has good proven experience with a range of observability tools, both on-prem and in the Cloud, with a good understanding of observability frameworks.Offering excellent ongoing professional development opportunities, they are keen to engage with professionals that will embrace this. … with 2 days a week required onsite. Key skills and experience: Experience of working in a similar role within a large-scale corporate environment Proven knowledge of working with observability tools to evaluate application/system state to provide a healthy and stable platform A firm and proven understanding of Monitoring and Observability tools and core concepts Experience of working More ❯
Employment Type: Full-Time
Salary: £50,000 - £60,000 per annum, Negotiable, Inc benefits
Posted:

Lead Azure Security Engineer

Potters Bar, Hertfordshire, South East, United Kingdom
Searchstone Ltd
/Insurance (or other regulated industry) Are you a hands-on Security Engineer with strong Azure expertise? Were looking for an all-rounder who can bring deep knowledge of observability, IDAM, and SIEM/SOC integrations into a complex, regulated environment. This role sits at the heart of our cloud security strategy, working with modern tools and protocols to keep … SOC/NOC, and IDAM integrations Driving best practice with Microsoft Entra (Azure AD/IDAM protocols) Using and optimising tools like Sentinel, Splunk, Datadog, OpenTelemetry, and ELK Supporting observability, scanning, and incident response capabilities Tackling complex security challenges in a financial services context What were looking for Proven experience as a Security Engineer , ideally in financial services or insurance … Strong knowledge of Azure security and cloud-native security tools Hands-on with SIEM, SOC/NOC, IDAM, and observability frameworks Experience with Microsoft Entra, Sentinel, Splunk, Datadog, OpenTelemetry, and ELK Excellent problem-solving skills and the ability to work in a complex, regulated environment Bonus points for: Microsoft security certifications, CISSP, CISM, or similar. If youre a technically strong More ❯
Employment Type: Permanent
Salary: £95,000
Posted:

Senior AI Engineer

London, United Kingdom
Colossyan
to deployment and monitoring, balancing cutting-edge techniques with pragmatism to deliver measurable impact. • Apply strong software engineering principles, such as modularity, testing, code reviews, CI/CD and observability, to ensure AI systems are reliable, maintainable, production-ready and can be readily adapted to future developments. • Choose the right approach for the problem at hand, evaluating classical ML and … focused teams, collaborating with designers, engineers, and PMs, to scope and ship AI features iteratively • Ability to reason about system behavior end-to-end, including model performance, latency, and observability, and how these impact user experience. • Clear, structured communicator, comfortable documenting and defending architectural decisions and engaging in thoughtful technical debate. Not required, but it's a plus if you More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Remediation Engineer with Security Clearance

Maryland Heights, Missouri, United States
Innova Solutions, Inc
A client of Innova Solutions is immediately hiring a Remediation Engineer. Position Type: Full-Time, Contract Duration: 6+ months Location: Maryland Heights, MO 63043 (Hybrid) As a Remediation Engineer, you will: • This person will spend most of their time analyzing More ❯
Employment Type: Permanent
Salary: USD 65 Hourly
Posted:

Lead Frontend Engineer

London, United Kingdom
Hybrid / WFH Options
Fruition Group
Lead Software Engineer London (Hybrid 1x Per Month) Salary: Up to £100k + Benefits + Stock Options About Us Our client is a Insurtech Unicorn looking to expand their engineering team. With the opportunity to work on existing products which More ❯
Employment Type: Permanent, Work From Home
Posted:

Senior Software Engineer - Storage Distributed Upstream

New York, United States
Bloomberg
Senior Software Engineer - Storage Distributed Upstream Location New York Business Area Engineering and CTO Ref # Description & Requirements About the Team The Storage Distributed Upstream team is laser-focused on advancing Ceph , the leading open-source distributed storage platform. Our More ❯
Employment Type: Permanent
Salary: USD Annual
Posted:
Observability
10th Percentile
£57,500
25th Percentile
£67,500
Median
£80,000
75th Percentile
£100,000
90th Percentile
£130,000