376 to 400 of 1,199 Permanent Observability Jobs

Senior Cloud Architect

Hiring Organisation
Quorum Network Resources
Location
Edinburgh, Midlothian, Scotland, United Kingdom
Employment Type
Permanent, Work From Home
balancing, segmentation, private connectivity) Security (IAM, MFA, encryption, policy enforcement) Storage & data services (object/block/file, backup, replication) Automation (IaC, pipelines, scripting) Observability (logging, metrics, tracing, SIEM integration) Why Join Quorum? Quorum is an employee-owned IT consultancy, delivering managed services, projects, and professional services to organisations across ...

Head of Software Engineering

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
customer demand grows. Lead on engineering security practices, particularly in the context of UK Government and defence data handling. Work with Ops to improve observability, incident response, and the reliability of a 24/7 operational system. Essential Skills and Experience Strong full-stack background: comfortable with cloud infrastructure.Proven experience ...

Solution Architect (Manchester)

Hiring Organisation
Jobleads-UK
Location
Greater Manchester, England, United Kingdom
driven solutions where appropriate Contribute to the evolution of end to end data management capabilities, embedding discovery, governance, lifecycle management, operability, resilience, auditability, and observability into solution designs by default Collaborate with architects, platform, security, and operations teams to ensure solutions meet production readiness, compliance, and assurance standards, and align ...

Solution Architect (Manchester)

Hiring Organisation
Jobleads-UK
Location
Manchester, England, United Kingdom
driven solutions where appropriate Contribute to the evolution of end‐to‐end data management capabilities, embedding discovery, governance, lifecycle management, operability, resilience, auditability, and observability into solution designs by default Collaborate with architects, platform, security, and operations teams to ensure solutions meet production readiness, compliance, and assurance standards, and align ...

Senior Product Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
frontend features and review frontend PRs Nice to Have Experience with voice agents or real-time audio pipelines Familiarity with evals, guardrails, or agent observability Familiarity with GCP/cloud infrastructure Experience in recruiting tech, marketplaces, or matching systems The Stack A modern Python stack: FastAPI, Postgres, Redis, Docker, Pydantic ...

Staff Software Engineer

Hiring Organisation
Stepstone UK
Location
South East London, London, United Kingdom
Employment Type
Permanent
APIs (e.g. OpenAI, Bedrock), prompt orchestration, tool usage and interaction patterns. Expertise in AI system design including state management, memory strategies, evaluation approaches, guardrails, observability and performance optimisation for LLM-powered systems. Strong communication and influencing skills, with experience mentoring engineers and driving high standards across teams while engaging both ...

Staff Cloud Support Engineer

Hiring Organisation
Crusoe
Location
San Francisco, California, United States
Employment Type
Permanent
Salary
USD Annual
Infrastructure Expertise Troubleshoot NCCL, IB, GPU driver/firmware issues, distributed training failures. Support complex AI workloads (training + inference) with performance tuning and observability improvements. Customer-Facing Authority Act as technical advisor during high-risk customer incidents. Deliver executive-ready RCAs with clarity and confidence. Drive trust through transparency ...

Engineering Manager - Billing & Payments (f/m/d)

Hiring Organisation
Awin
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
Competitive salary
engineering best practices Consider long-term technical strategy, trade-offs, and system interactions A data-informed approach to decision-making, with a bias toward observability and learning A proactive mindset toward AI and emerging technologies, with the curiosity and willingness to experiment, learn, and leverage them to drive team efficiency ...

Principal Architect - GenAI

Hiring Organisation
Jobleads-UK
Location
Isleworth, England, United Kingdom
frameworks (e.g., TensorFlow, PyTorch) and cloud platforms (AWS, Azure, GCP). Experience with AI infrastructure components including model gateways, orchestration layers, and observability tools. Familiarity with AI governance, data privacy, international regulation and ethical AI frameworks. Excellent leadership, communication, and interpersonal skills. Strong analytical and problem‐solving abilities. Ability ...

Senior Software Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Software Engineer, you will be responsible for the design, development, deployment, and operation of business‐critical features that add customer value. Operational excellence, metrics, observability and best practices, evangelization, and mentoring in your team and across the whole of Engineering will be part of your day‐to‐day job. Quality ...

AI Architect – Enterprise Strategy

Hiring Organisation
Jobleads-UK
Location
Chelmsford, England, United Kingdom
scalable MLOps roadmaps. Govern & Scale Responsibly: Enterprise AI demands bulletproof reliability. You will implement cutting‐edge MLOps best practices, focusing on observability, rigorous testing, and QA to mitigate bias and hallucinations, ensuring AI outputs are safe, ethical, and value‐driven. Integrate & Innovate: You’ll deploy sophisticated models within major enterprise ...

Head of AI Infrastructure & Machine Learning Operations

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
runtime management across varied environments (batch, real-time, REST, edge), using tools like MLflow, SageMaker, Databricks, or other equivalent.* Strong background in monitoring, observability, and incident response — including drift detection, fairness tracking, latency alerts, and recovery protocols.* Skilled in building secure, compliant AI infrastructure aligned with regulatory standards (e.g., GDPR ...

Head of Digital Delivery

Hiring Organisation
Pikl
Location
Norwich, England, United Kingdom
organisational design, workforce planning, and hiring strategy Standards, Quality & Technical Excellence Establish and uphold best practices across SDLC, code quality, testing, observability, and incident management Partner with engineering leadership to ensure systems are scalable, secure, and maintainable Drive the adoption of modern development practices (Agile, DevOps, CI/ ...

Senior Software Engineer - Policy & Control

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
e.g., E2E\/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post‐mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

Lead Engineer (Routing Squad)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
Technologies we use Backend languages: Python, Go Tech infrastructure: AWS, CDK TypeScript, Lambda, SQS, EventBridge, RDS, DynamoDB Data tooling: GCP, BigQuery, Looker, Looker Studio Observability: Loki, Tempo, Grafana, Prometheus Event-driven architecture and domain-driven design How we reward our team Dynamic hybrid working environment with a diverse and driven ...

Senior Software Engineer - Flights

Hiring Organisation
Jobleads-UK
Location
Birmingham, England, United Kingdom
e.g., E2E/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post‐mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

Senior Software Engineer - Flights

Hiring Organisation
Jobleads-UK
Location
City of Edinburgh, Scotland, United Kingdom
e.g., E2E/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post‐mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

Senior Software Engineer - Flights

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
e.g., E2E/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post‐mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

Field Solutions Engineer, Spacetime Integrations

Hiring Organisation
Jobleads-UK
Location
United Kingdom
Active U.S. SECRET clearance (or higher). Experience with networking/infrastructure platforms (routing/switching concepts, SDN/traffic engineering, network telemetry/observability). Experience integrating telecom, satellite, aerospace, or defense systems (terminals/modems, gateways, NMS/OSS/BSS, mission networks). Experience with system bring ...

Senior Software Engineer - Billing (VAT & Invoicing)

Hiring Organisation
Jobleads-UK
Location
City of Edinburgh, Scotland, United Kingdom
e.g., E2E/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post‐mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

Senior Software Engineer - Flights

Hiring Organisation
Jobleads-UK
Location
Birmingham, England, United Kingdom
e.g., E2E/Cypress). Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post-mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

Core Network Engineer 5G

Hiring Organisation
Eclaro
Location
Richardson, Texas, United States
Employment Type
Permanent
Salary
USD 125,000 Annual
supporting both internal network operations and contracted wholesale customer deployments across U.S. and international environments. Innovation is delivered through practical improvements in automation, reliability, observability, and deployment efficiency rather than strategic product ownership. Responsibilities: Core Network Design, Deployment and Management: Design, deploy, operate, and maintain 5G Core Network (5GC) components ...

Senior Software Engineer - Billing (VAT & Invoicing)

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
performance budgets, and comprehensive testing. Backend Excellence: Engineers sophisticated backend solutions involving API versioning, caching strategies, and complex data migration plans. Operational Maturity: Leads observability and SRE practices; defines SLOs, manages incident responses, and conducts blameless post‐mortems. Security & Risk: Oversees operational security, including secrets hygiene and dependency risk management ...

DoW Linux System Administrator (Mid/Senior) - 28244

Hiring Organisation
HII Mission Technologies Division
Location
Roanoke, Virginia, United States
Employment Type
Permanent
Salary
USD Annual
with virtualization platforms (VMware, Nutanix, or similar) Familiarity with DevSecOps toolchains and CI/CD pipelines (GitLab, Jenkins, Nexus, Artifactory) Strong background in monitoring, observability, and performance tuning Prior experience in a secure DoD, IC, or defense contractor environment Proven ability to lead incident response and root cause analysis ...

Senior Frontend Engineer II - eCommerce

Hiring Organisation
Prenuvo
Location
Bodega Bay, California, United States
Employment Type
Permanent
Salary
USD Annual
using a hook-first architecture (separating business logic from presentational components) and modern state patterns (optimistic updates, single-flight request patterns). Ensure Reliability & Observability: Set up and maintain Datadog RUM dashboards, implement comprehensive error tracking, and debug production issues to maintain high system reliability. Champion Testing & Quality: Write ...