301 to 325 of 1,260 Observability Jobs

Senior Software Engineer (Payments), Frontend

Hiring Organisation: GoodLeap
Location: Palm Beach, Florida, United States
Employment Type: Permanent
Salary: USD Annual

performance, scalability, resiliency, and security, particularly within payments and financial workflows. Establish and uphold best practices in frontend architecture, code quality, testing, and observability, including writing comprehensive unit and integration tests. Contribute to and evolve shared frontend systems (e.g., design systems, component libraries, micro-frontend architecture) to enable high team ...

Senior Software Engineer - Clinical Background

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

your code into the hands of users independently. Rigorous testing in production: You understand that "works on my machine" isn't enough; you implement observability and feedback loops to monitor how your AI features perform in the wild. Medical degree with clinical experience , and ideally experience working on clinical ...

Lead Engineer - Connectivity LAN

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

hybrid network environments.Operational Excellence* Lead troubleshooting and root cause analysis activities across complex LAN, campus, and data centre connectivity environments.* Contribute to improvements in observability, monitoring, network telemetry, and performance diagnostics.* Evaluate emerging LAN technologies, vendors, and design innovations, presenting findings to senior engineering stakeholders.Ways of Working* Work effectively within ...

Senior Cloud Engineer (Multiple Levels) - 28329

Hiring Organisation: HII Mission Technologies Division
Location: Hanover, Maryland, United States
Employment Type: Permanent
Salary: USD Annual

Management (ICAM) Generate Dev, Staging, and Production environments Automate application deployment and configuration Maintain shared enterprise services (ICAM, data stores, security scanners, etc) Manage observability, logging, and alerting Establish and maintain Service Level Agreements Minimum Qualifications Level 3: 5 years experience with Bachelors in related field; 3 years experience with ...

Technical Project Manager

Hiring Organisation: Jobleads-UK
Location: United Kingdom

with production requirements and service levels. Coordinate release readiness across testing, evaluation, and go‐live plans. Infrastructure & Operations Align engineering and DevOps on environments, observability, and incident response, ensuring deployments are auditable, cost‐aware, and reversible. Oversee vendor and subcontractor deliverables, ensuring integrations meet contractual, performance, and compliance requirements. ...

Machine Learning Engineer, Senior Manager

Hiring Organisation: Credit Acceptance Corporation
Location: United States
Employment Type: Permanent
Salary: USD 270,386 Annual

such infrastructure Hands-on expertise in scaling and maintaining production-grade ML services, with a strong focus on ML/LLM Operations (versioning, automation, observability, automated training and monitoring, etc.) and ability to balance ML model complexity with production requirements Passion for identifying new business opportunities and experience of using ...

Infrastructure Engineering, AVP-2

Hiring Organisation: State Street
Location: Greater London, United Kingdom
Employment Type: Full Time

workloads Develop Infrastructure as Code solutions using Terraform (preferred) for Azure resources and AKS provisioning Automate operational workflows including cluster provisioning, scaling, and recovery Observability, Monitoring & Reliability Engineering Design and maintain platform observability frameworks using: Prometheus, Grafana, Dynatrace, Elasticsearch Azure Monitor, Log Analytics OpenTelemetry (where applicable) Ensure proactive monitoring … cloud (AWS/GCP) and cloud-native architectures. Hands-on experience with DevOps Platform Tooling (i.e) ArgoCD, Terraform, Azure Devops, scripting Operational experience with observability tools Dynatrace, Prometheus, Grafana, OpenTelemetry Experience influencing or owning platform/product roadmaps in partnership with Product Management. Solid background in cloud native engineering concepts ...

Engineering Team Lead

Hiring Organisation: IRIS Software Group
Location: United Kingdom

ensure adoption of deployment best practices across the team Infrastructure as Code : Infrastructure as Code with Terraform, ARM Templates or AWS CloudFormation/CDK Observability Strategy : DataDog, Application Insights or Amazon CloudWatch implementation with comprehensive observability strategy and cloud governance Cloud Strategy : Strategic SAAS using Azure (Functions, Queue, Blob Storage … team, and measure team adoption with continuous improvement initiatives Application Security: Confident vulnerability management, thread modelling and tracking Production Support : Knowledge of observability and production support practices Experience Requirements 6+ years in software engineering, minimum 1 year of formal people management experience OR 2+ years of technical leadership with mentoring ...

Engineering Lead

Hiring Organisation: IRIS Software Group
Location: United Kingdom

ensure adoption of deployment best practices across the team Infrastructure as Code : Infrastructure as Code with Terraform, ARM Templates or AWS CloudFormation/CDK Observability Strategy : DataDog, Application Insights or Amazon CloudWatch implementation with comprehensive observability strategy and cloud governance Cloud Strategy : Strategic SAAS using Azure (Functions, Queue, Blob Storage … team, and measure team adoption with continuous improvement initiatives Application Security: Confident vulnerability management, thread modelling and tracking Production Support : Knowledge of observability and production support practices Experience Requirements 8+ years in software engineering, minimum 1 year of formal people management experience OR 2+ years of technical leadership with mentoring ...

DevOps Engineer - Defence

Hiring Organisation: Anson Mccade
Location: Bristol, Avon, South West, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £90,000

similar Build and manage containerised environments using Docker and Kubernetes Support deployment and operations across cloud platforms (AWS, Azure, GCP) Implement monitoring and observability solutions using ELK, Grafana, or similar tools Manage artifacts and quality processes using Artifactory, SonarQube, or equivalent Support system administration across Linux and Windows environments , including … experience with: Infrastructure as Code (Terraform, Ansible, Packer) Containerisation and orchestration (Docker, Kubernetes) CI/CD pipelines and automation Experience with monitoring, logging, and observability tooling Experience working with cloud platforms (AWS, Azure, or GCP) Strong understanding of system administration across Linux and/or Windows Experience working in complex ...

Site Reliability Engineer — AWS & Observability

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

place for you. What You'll Do Own reliability – Maintain and improve our AWS infrastructure using Terraform, bringing your expertise and best practices Champion observability – Partner with developers to implement effective monitoring, logging, and tracing strategies Strengthen security – Work closely with the CISO to implement security best practices and ensure … compliance Optimise costs – Monitor cloud spend and implement FinOps best practices Maintain CI/CD pipelines – Implement and maintain reliability and observability aspects of GitHub workflows and deployment pipelines Incident response – Lead incidents, run blameless post-mortems, and drive continuous improvement Enable developers – Mentor teams on SRE and observability practices ...

Principal AI Architect

Hiring Organisation: MBN Solutions
Location: England, United Kingdom

experiments) Core stack LangGraph · CrewAI · Python · FastAPI · Docker/E2B · AWS/Azure · Kubernetes Nice to have LLM orchestration experience Security/guardrails frameworks Observability (LangSmith) Model routing/LiteLLM/tiering strategies Early-stage startup experience Enterprise client exposure ...

Cloud Native Architect

Hiring Organisation: Lawrence Harvey
Location: Leeds, West Yorkshire, United Kingdom
Employment Type: Permanent
Salary: £75000 - £85000/annum

/CD design and implementation using tools such as GitHub, Terraform, or similar Solid knowledge of cloud security, IAM, networking, data flows, and observability This is an excellent opportunity for a Cloud Architect who enjoys building scalable cloud foundations, driving automation, and establishing strong engineering standards. ...

Senior Software Engineer, Data Platform

Hiring Organisation: Jobleads-UK
Location: Washington, England, United Kingdom

documents, improving code quality, and improving team performance and processes; Acting as a senior technical owner in debugging complex production issues and improving system observability; Collaborating with cross-functional teams to support their data infrastructure needs. Must-have qualifications: Extensive hands-on software engineering experience, with a strong track record ...

Oracle Cloud Architect

Hiring Organisation: Neotecra, Inc
Location: New York, United States
Employment Type: Permanent
Salary: USD Annual

ExaDB-D, Exascale, DBaaS, OEM Architecture: High Availability, Disaster Recovery, Multi-Cloud Architecture. Automation: Terraform, Ansible, API/CLI automation SRE and DevOps: Monitoring, Observability, SLAs/SLOs/SLIs Key Skills OCI: 6 9 years Azure/AWS/Google Cloud Platform: 1 2 years each (nice to have ...

Lead Software Engineer - Privileged Access Management

Hiring Organisation: Arcus Search
Location: City of London, London, United Kingdom

Help shape technical direction and engineering standards across the platform Work across Python, Go, and TypeScript — choosing the right tool for the job Improve observability, reliability, and operational excellence across critical systems Collaborate closely with engineers, security, and infrastructure teams Solve hard problems around access control, automation, and cloud-scale ...

Senior Data Engineer

Hiring Organisation: Signify Technology
Location: Farringdon, Devon, UK

Strong SQL and Python skills, with hands-on experience in dbt, Databricks, Airflow or Looker Solid analytics engineering fundamentals — data modelling, warehousing, performance optimisation, observability and governance Experience in a marketplace or e-commerce environment, with comfort around metrics like GMV, conversion rates and retention Background working with financial data ...

Senior Data Engineer

Hiring Organisation: Signify Technology
Location: Farringdon, England, United Kingdom

Senior Software Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

will: Architect and build core backend systems powering our agents Own integrations with real clinical systems and scheduling platforms Improve performance, reliability, and observability across the platform Contribute to agent reasoning flows, memory layers, and workflow design Collaborate directly with founders, product, and customers Shape our engineering culture and long ...

Software Engineer

Hiring Organisation: 5V Video
Location: City of London, London, United Kingdom

Lambda, API Gateway, S3, DynamoDB, IAM) Event-driven systems (Kafka, SNS/SQS) Databases (Postgres, DynamoDB, Couchbase) CI/CD (Concourse, Git workflows) Observability (Prometheus, Grafana, CloudWatch) What You’ll Bring Strong backend engineering experience (Python preferred) Experience building and supporting microservices in production Good understanding of event-driven architectures ...

Backend Software Engineer Python LLM - Finance

Hiring Organisation: Client Server
Location: East London, London, United Kingdom
Employment Type: Permanent, Work From Home
Salary: £90,000

development You have hands-on experience deploying LLMs and multi-modal models at scale in production You have a strong understanding of scalable MLOps, observability and cloud-native AI deployment You're collaborative and pragmatic with advanced stakeholder communication, problem-solving and project management skills in Agile environments Experience with ...

Platform Engineer

Hiring Organisation: Rebel Recruitment Limited
Location: Sheffield, South Yorkshire, United Kingdom
Employment Type: Permanent
Salary: £40000 - £50000/annum

infrastructure services Creating reusable, automated solutions using Infrastructure as Code (Bicep) Building and maintaining CI/CD pipelines for consistent, reliable delivery Improving platform observability, monitoring, and operational resilience Supporting engineering teams with best practices and self-service capabilities Contributing to the development of an Internal Developer Platform What ...

Platform Engineer

Hiring Organisation: Prism Digital
Location: Milton Keynes, England, United Kingdom

availability Own incident resolution, root cause analysis, and continuous improvement Collaborate with engineers and third-party providers to mature the platform Contribute to monitoring, observability, and cost optimisation strategies Support projects and business initiatives through robust platform delivery What They’re Looking For: Microsoft Fabric experience Terraform experience Cloud platform ...

Senior Software Engineer

Hiring Organisation: ZEREN
Location: London Area, United Kingdom

common SaaS platforms Designing data pipelines that ingest provider APIs into our analytics datastore Improving the agent platform and executor with better error handling, observability, and speed Making LLM interactions more efficient through prompts, snippets, and tooling Improving internal developer experience and platform tooling Helping take an early MVP into ...

Site Reliability Engineer (DataCosmos)

Hiring Organisation: Jobleads-UK
Location: East Hagbourne, England, United Kingdom

work with Linux systems and cloud platforms (AWS, GCP or Azure) Solid Kubernetes knowledge and ability to run production systems A clear understanding of observability (monitoring, logging, tracing) Capable of designing or operating high-availability, distributed systems A mindset focused on automation, scalability, and continuous improvement Confidence working in fast ...