76 to 100 of 496 Permanent Observability Jobs

Senior DevOps Engineer (Product)

Hiring Organisation
Hive Science
Location
London Area, United Kingdom
monitoring for our AI/ML products. • Automate infrastructure provisioning (Terraform), configuration management, and deployment processes using scripting (Bash, Python) and automation tools. Monitoring, Observability & Reliability: • Implement comprehensive monitoring, logging, and alerting systems (Prometheus, Grafana, CloudWatch, Datadog, Sentry) to ensure system reliability and rapid incident response. • Establish SLOs/SLIs … implement observability best practices to maintain high availability and performance. • Lead incident response, root cause analysis, and implement preventive measures to improve system resilience. Security & Governance: • Implement and maintain security best practices including network security, firewalls, role-based access control (IAM), encryption at rest and in transit, and secrets management ...

Senior DevOps Engineer | AI‐Driven SaaS | Azure | Tech Scale Up | Oxfordshire

Hiring Organisation
Opus Recruitment Solutions
Location
Oxfordshire, England, United Kingdom
highly collaborative engineering team, scaling modern cloud infrastructure for a global SaaS platform. You’ll work across Azure and Kubernetes, working on platform observability, automation, and secure, compliant cloud systems powering advanced voice‐driven products used by thousands of professionals. What’s on offer: competitive salary based on experience, flexible ...

Node.js developer

Hiring Organisation
act digital
Location
Portugal
Employment Type
Permanent
Salary
EUR Annual
ones You are motivated by solving complex, real-world engineering problems Nice to have: Strong experience with testing & QA practices (unit, integration, E2E) Observability and monitoring (logging, metrics, tracing) Experience working in platform teams or internal product teams Experience modernising legacy systems Experience with cloud-native architectures ...

GCP Engineer

Hiring Organisation
Hamilton Barnes 🌳
Location
England, United Kingdom
using Terraform Build and maintain CI/CD pipelines for Kubernetes-based deployments Implement and manage Istio service mesh for traffic management, security, and observability Support containerized microservices at scale Required Skills & Experience (Must-Have) Strong hands-on experience with GKE in production Solid Kubernetes fundamentals (networking, RBAC, scaling, troubleshooting ...

Software Engineering Manager

Hiring Organisation
Halian Technology Limited
Location
Central London, London, United Kingdom
Employment Type
Permanent
815engineersdependingonlevel). Buildastrongengineeringculturecenteredonownership,quality,andcontinuousimprovement. Conductperformancereviews,careerdevelopmentplanning,andsuccessionplanning. Hireandretaintopengineeringtalent. TechnicalOversight(ClosetotheCode) Guidearchitecturaldecisionsfordistributed,high-availabilitypaymentsystems. Ensurehighstandardsofcodequality,systemreliability,scalability,andsecurity. Reviewdesigndocumentsandparticipateincriticalcodereviewswhennecessary. PartnerwithStaffandPrincipalEngineerstoevolveplatformarchitecture. Driveengineeringbestpractices(CI/CD,observability,testautomation,DevSecOps). Delivery&Execution OwnroadmapexecutionincollaborationwithProductandDesign. Balancespeedofdeliverywithregulatory,security,andoperationalrequirements. Ensurepredictabledeliveryofcomplex,cross-functionalinitiatives. Managetechnicaldebtstrategically. Payments&RegulatoryFocus Overseedevelopmentofpaymentprocessingsystems,APIs,ledgersystems,fraudpreventiontools,andcompliance-relatedservices. Ensuresystemsmeetfinancialregulatoryanddatasecurityrequirements(e.g.,PCIDSS ...

Senior Developer React

Hiring Organisation
Get2Talent
Location
Cambridge, Cambridgeshire, East Anglia, United Kingdom
Employment Type
Permanent
Salary
£85,000
Workprimarilywith: TypeScript (Node.js &React) Monorepotooling,GitHub,GitHubActions Jest,Playwright Redis,MSSQL,WebSockets Docker,Kubernetes Observabilitytools (Grafana,Prometheus,SonarQube) Take end-to-endownership offeaturesfromdesigntoproduction. CollaboratecloselywithplatformandDevOpsengineersonbuildpipelines,observability,andoperationalconcerns. Communicatedirectlywithclientstoclarifyrequirementsandproposesolutions. Contributetoandimproveautomatedtestingpractices. Participateinpeercodereviewsandmaintainhighengineeringstandards. Leverage LLM/AI-enableddevelopmenttools aspartofday-to-daydevelopment. Requirements 8+years ofprofessionalsoftwaredevelopmentexperience. 3+years hands-onexperiencewith TypeScript,Node.js,andReact . Goodexperiencebuildingandmaintaining productionsystems . ...

DevOps Engineer

Hiring Organisation
Anson Mccade
Location
Newcastle Upon Tyne, Tyne and Wear, North East, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£50,000
cloud). Skilled with Infrastructure as Code and deployment automation. Comfortable in Linux environments and working with scripting languages. Familiar with monitoring, logging, and observability tools. Knowledge of Agile and Lean software delivery methodologies. Confident, proactive, and able to work collaboratively within a technical team. Nice to Have: Experience leading ...

Staff Machine Learning Engineer - AI Imaging

Hiring Organisation
Discover International
Location
United States
Employment Type
Permanent
Salary
USD Annual
stores, and automation tooling to accelerate ML experimentation and deployment. Build and maintain pipelines for data ingestion, training, evaluation, and model serving. Implement monitoring, observability, and reliability frameworks to ensure production-grade performance. Optimise performance across the ML stack - balancing latency, scalability, and cost-efficiency. Requirements: Experience building and deploying ...

Senior Platform Engineer

Hiring Organisation
Fruition Group
Location
City of London, London, United Kingdom
Designing and operating agent-based and multi-model workflows Building and scaling RAG pipelines , retrieval services, and vector search infrastructure Improving the reliability, latency, observability, and cost-efficiency of AI workloads Putting the right security, governance, and operational controls in place Automating deployment and platform operations using Kubernetes, Terraform ...

Senior Software Engineer (PHP)

Hiring Organisation
Iris Recruitment
Location
Slough, Berkshire, South East, United Kingdom
Employment Type
Permanent
feature delivery Partnering with Product on roadmap and prioritisation Mentoring and supporting junior engineers Leading best practice adoption (testing, CI/CD, observability) Contributing to architectural decisions Managing technical debt and platform health Supporting incident resolution and continuous improvement Our Tech Stack Core: PHP (Laravel) ReactJS JavaScript Relational databases Kubernetes ...

Senior AI Engineer (Platform)

Hiring Organisation
Harnham
Location
New Malden, England, United Kingdom
Garden) Strong Python and API development experience Hands‐on with LLMs, embeddings, RAG, vector databases Proven track record building production‐ready AI systems with observability and testing Solid DevOps capability: CI/CD, Docker, Kubernetes Work Environment & Location 1–2 days per week on‐site in either: South West London ...

Senior Java Software Engineer

Hiring Organisation
Synchro
Location
London Area, United Kingdom
Java/Kotlin (JVM ecosystem) Microservices architecture Cloud-native (AWS/GCP) Spring/Spring Boot Docker & Kubernetes Messaging & streaming technologies Distributed tracing and observability tooling What you’ll be doing Designing and building scalable, resilient distributed systems Contributing to architectural decisions across greenfield platforms Writing clean, secure, and maintainable ...

Senior Data Engineer

Hiring Organisation
Harnham - Data & Analytics Recruitment
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£80,000 - £90,000 per annum, Inc benefits
pipelines within a large scale AWS environment. * Build well structured data models and curated layers to support reporting and analytics. * Improve data quality, observability, governance, and overall platform reliability. * Lead architectural decisions and support a review of existing workflows and pipelines. * Contribute to ongoing transformation and migration projects. * Provide technical ...

Site Reliability Engineer

Hiring Organisation
Anson Mccade
Location
Gloucester, Gloucestershire, South West, United Kingdom
Employment Type
Permanent
Salary
£65,000
Replace repetitive manual labor with innovative automated solutions. Consultative Engineering: Work alongside product teams to advise on best practices for system design and resilience. Observability: Instrument applications to improve monitoring and use data-driven insights to demonstrate daily system improvements. Systems Architecture: Leverage your understanding of the relationship between software ...

Software Engineer (Frontend)

Hiring Organisation
Orchestra
Location
City of London, London, United Kingdom
least five years in commercial software engineering 🚀 About Orchestra Orchestra is an AI-native data pipelining platform building the future of data orchestration and observability . We have built a declarative framework allowing data engineers to explicitly declare how their data pipelines behave. Think CI/CD but for data ...

Senior Cloud Engineer (Azure, Terraform, Kubernetes)

Hiring Organisation
Method-Resourcing
Location
City of London, London, United Kingdom
Employment Type
Permanent
into Azure. Delivering a blend of lift-and-shift for Windows-based virtual machines and containerisation into AKS. Post-migration work focused on improving observability, resilience, disaster recovery, and platform scalability. Working closely with developers to design and operate reliable, cloud-native solutions. What we're looking for: Production experience ...

Artificial Intelligence Engineer

Hiring Organisation
LT Harper Recruitment Group
Location
City of London, London, United Kingdom
tooling and infrastructure. What you’ll be doing Designing, building, and maintaining production-grade AI applications using modern engineering practices (CI/CD, testing, observability, cloud-native design). Developing reusable AI platforms and tools (e.g. conversational bots, AI-powered search, unstructured data processing, GenAI solutions). Working in cross ...

Senior Cloud Engineer (Azure, Terraform, Kubernetes)

Hiring Organisation
Method Resourcing
Location
London, South East, England, United Kingdom
Employment Type
Full-Time
Salary
£90,000 - £100,000 per annum
into Azure. Delivering a blend of lift-and-shift for Windows-based virtual machines and containerisation into AKS. Post-migration work focused on improving observability, resilience, disaster recovery, and platform scalability. Working closely with developers to design and operate reliable, cloud-native solutions. What we're looking for: Production experience ...

Platform Engineer

Hiring Organisation
Hireful
Location
Peterlee, County Durham, North East, United Kingdom
Employment Type
Permanent
Salary
£60,000
container configuration and Helm charts Code written in C#/.NET Experience with scripting tools such as PowerShell, Bash, Python, or Bicep Understanding of observability & monitoring platforms If youre looking to progress your DevOps career within a collaborative, forward-thinking engineering team, this is an excellent opportunity to make ...

Platform Enigneer

Hiring Organisation
hireful
Location
Peterlee / Work from home, County Durham, United Kingdom
Employment Type
Permanent
Salary
£50000 - £63000/annum £50K - £63K Basic (DoE) + Bonus + Ex
container configuration and Helm charts Code written in C#/.NET Experience with scripting tools such as PowerShell, Bash, Python, or Bicep Understanding of observability & monitoring platforms If you’re looking to progress your DevOps career within a collaborative, forward-thinking engineering team, this is an excellent opportunity to make ...

Senior Data Engineer

Hiring Organisation
ISR Recruitment Ltd
Location
Manchester, Lancashire, United Kingdom
Employment Type
Permanent
Salary
GBP 60,000 - 75,000 Annual
data architecture principles across data lakes, warehouses and event-driven solutions Develop and maintain streaming pipelines using technologies such as Kafka Implement monitoring and observability solutions using tooling such as Prometheus and Grafana Ensure data quality, validation and governance processes are built into engineering workflows Act as a trusted technical ...

DevOps Engineer

Hiring Organisation
Synoptix
Location
United Kingdom
Employment Type
Permanent
Salary
GBP 40,000 - 50,000 Annual
improvement of key systems. Essential Skills: Knowledge of DevOps practices including: CI/CD pipeline design and automation Containerisation and orchestration Monitoring and observability tools Experience in the defence or advanced technology sector Experience with GPU based computer environments Experience with MLOps and associated tooling Experience with data pipelines Experience ...

DevOps Engineer

Hiring Organisation
Synoptix Limited
Location
Bristol, Avon, South West, United Kingdom
Employment Type
Permanent, Work From Home
Salary
£50,000
improvement of key systems. Essential Skills: Knowledge of DevOps practices including: CI/CD pipeline design and automation Containerisation and orchestration Monitoring and observability tools Experience in the defence or advanced technology sector Experience with GPU based computer environments Experience with MLOps and associated tooling Experience with data pipelines Experience ...

Senior Principal Architect — IaaS & Compute Platforms

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
consumption of compute infrastructure. Conducts architecture and design reviews; document decisions, risks, and exceptions; present in governance forums and lead technical discussions. Drives platform observability and capacity planning; define approaches for resource utilization, performance monitoring, and cost optimization. Promotes effective ways of working and team productivity; support backlog management ...

AI Engineer

Hiring Organisation
Stackstudio Digital Ltd
Location
United Kingdom
Employment Type
Permanent
maintain MLOps/LLMOps workflows-CI/CD for models and prompts, model registry/versioning, feature stores, and automated promotion across environments Instrument observability for data, models, and prompts (telemetry, metrics, traces, dashboards, alerts); implement A/B tests and online/offline evaluation Embed Responsible AI considerations (fairness ...