Observability Jobs

1 to 25 of 147 Observability Jobs

Lead Site Reliability Engineer

Leeds, England, United Kingdom
Fruition IT
/SRE team. The Lead Site Reliability Engineer will lead the charge in selecting, configuring, and supporting Cloud Platform components and tooling. Proficiency in observability tech such as Grafana and Prometheus is essential. An ability to self-manage in both Agile and traditional delivery approaches is a key asset The … will be paramount for collaborating with stakeholders and mentoring team members. Key Skills Experience with GCP, AWS or Azure Leadership/management experience Terraform Observability tech such as Grafana/Prometheus Background in software engineering is an advantage If you are interested in the role please apply! We are an more »
Posted:

Lead Infrastructure Engineer

Cheltenham, England, United Kingdom
Yolk Recruitment Ltd
doing: Designing, building & maintaining secure infrastructure systems. Employ and adapt industry-leading techniques and best practices to suit customer challenges. Work within existing monitoring, observability and security frameworks. Engage regularly with customers to ensure projects are delivered on time & to agreed specifications. Leading & managing team members including on-boarding more »
Posted:

1st Sales Director in EMEA

Greater London, England, United Kingdom
Pivotal Partners
to calls, ultimately enhancing our service to its customers. Qualifications: 5+ years of experience in sales-oriented role MUST have sold a DevOps or Observability tool/product. (Observability, CloudNative, Kubernetes, CICD, DevSecOps) Exposure to customer service. Eagerness and aptitude for quickly grasping technical concepts. Aspiration to build a successful more »
Posted:

Site Reliability Engineer

London Area, United Kingdom
Acquire Me
similar frameworks/tech. Strong Automation & Config management tooling experience - preferred Ansible/Puppet/Terraform. Strong Linux troubleshooting skills Excellent practical experience with Observability systems. Excellent communicator - well versed in working directly with stakeholders. Excellent P&L linked bonus pay, generous base salary and comprehensive suite of benefits on more »
Posted:

Full Stack Engineer

United Kingdom
Templeton and Partners - Tech Recruitment
teams, engineers love to work with you - Experience developing high performance, highly available & scalable applications with a micro-services architecture and an understanding of observability - Design data models and define optimized tables/views in BigQuery and optimize SQL queries to improve performances or reduce costs - Actively contribute to the more »
Posted:

Observability Engineer

London Area, United Kingdom
Hybrid / WFH Options
Anaplan
that’s dedicated to creating opportunities for our customers, partners, and employees. We hope you’ll join us. Let’s create something incredible together! Observability Engineer At Anaplan we are looking for a self-motivated Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company … working people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Observability Engineer, you will be working on the tools used to collect and analyse Observability telemetry (Logs, Metrics and Traces). You will enable engineers across … What you’ll be doing: In this role, working a minimum of 2 days a week in our London Office, you will be: Administering observability infrastructure. Deploying and configuring OTEL agents to collect telemetry, and to visualise this data in Grafana. Pairing with your colleagues to build everything from rapid more »
Posted:

Site Reliability Engineer

London Area, United Kingdom
TravelPerk
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
Posted:

Observability Engineer

London Area, United Kingdom
Atyeti Inc
About the job : We are seeking a dynamic and experienced Observability Engineer with expertise in any cloud, Grafana/Prometheus/Datadog Role & Responsibilities * Develop and improve instrumentation for monitoring and logging the health and availability of services. * Proactively monitor systems, networks, and applications to provide input in improving the more »
Posted:

Systems Engineer - Azure Cloud

Reading, Pennsylvania, United States
Penske Truck Leasing
environments. In this role, you will be working with Principal Cloud Architects, Cybersecurity Teams, and other Cloud Engineers to implement Governance, Automation, and Observability Solutions which will maintain Penske's Enterprise Standards & Cybersecurity for Azure and Microsoft 365. This will include the use of Azure Active Directory, Azure Policy, Automation … Work with Cybersecurity Team to adopt and enforce Cybersecurity Standards such as CIS, NIST • Work with Cybersecurity, Engineering, Development, and Operations Teams to enhance Observability with Azure Log Analytics, Azure Monitor, Azure Data Explorer, Synapse, Microsoft Sentinel • Use Azure Security Tools such as Microsoft Defender for Cloud to review environments more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Platinion Associate Director or Principal, Enterprise Solutions

Indonesia
Boston Consulting Group
vs. open source software. Approaches to managing Architectural debt, Architecture governance and evolution in practice Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Platinion Associate Director or Principal, Enterprise Solutions

Singapore
Boston Consulting Group
vs. open source software. Approaches to managing Architectural debt, Architecture governance and evolution in practice Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. more »
Employment Type: Permanent
Salary: SGD Annual
Posted:

Platinion Associate Director or Principal, Enterprise Solutions

Kuala Lumpur, Federal Territories, Malaysia
Boston Consulting Group
vs. open source software. Approaches to managing Architectural debt, Architecture governance and evolution in practice Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. more »
Employment Type: Permanent
Salary: MYR Annual
Posted:

Infrastructure Engineer (WebSphere)

Illinois, United States
Discover Financial Services
the same) by using informal leadership & highly developed communication skills and contributes to or led technology communities Uses automation, system tools, open-source solutions, observability and 'security first' principles in daily work Achieves product commitments (and influences others to do the same) by using informal leadership & highly developed communication skills … and contributes to or led technology communities Uses automation, system tools, open-source solutions, observability and 'security first' principles in daily work Contributes to team agile ceremonies, leads demos and presentations, helps new engineers learn established norms Initiates high level solution design approaches, and guides team to achieve desired key … software delivery capabilities using automated, coded enterprise and observability Participates in internal speaking and advocacy events Supports research activities to adopt new technology solutions in ways of developing new capabilities Continues professional education and creates opportunities for core product teams to learn engineering best practices Coaches immediate chapter and actively more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Application Engineer (SRE)

Houston, Texas, United States
Discover Financial Services
our Application Develop teams to implement service level objectives Partner with our Application Development teams and other SREs to build out end to end observability Implement monitoring, alerting and dashboards needed for our apps Automate operational processes Help to develop our capacity management and performance management tools Help to define … mission critical environment Experience with programming and/or scripting languages (TAL, TACL, C, C++, Cobol 85, Python, Java, bash) Experience with monitoring and observability tools/technologies (i.e., Prognosis, Web ViewPoint, Grafana, Kibana, Datadog, AppDynamics) Creation of standardized monitoring dashboards in cloud platforms for proactive monitoring of application and more »
Employment Type: Permanent
Salary: USD Annual
Posted:

senior engineer, DPS AI/ML Platforms - ST

Seattle, Washington, United States
Starbucks
ARM Demonstrated expertise in Kubernetes Cluster Design, administration and operations Demonstrated expertise in CICD tooling such as Jenkins, Azure Dev Ops Demonstrated expertise in Observability and Proactive Alerting/Monitoring platforms like Datadog, PagerDuty and ServiceNow. Exposure to Azure services like Azure Databricks and Azure Machine Learning desirable This role … handbooks for all production models. Ability to perform Root Cause Analysis for production issues of ML Models. Ensuring platform and (model+app+data) health monitoring and observability framework modules/utils are developed and published for integration in ML Pipelines. Develop and enable self-serve model observability(model monitoring, drift monitoring , perf … SLA's) dashboards for platform and business stakeholders. Develop and maintain Data Quality and Data observability framework for integrations in ML pipelines. Build , develop and standardize integrations needed for Model Governance framework implementations. Demonstrate ability to collaborate with a geo-graphically spread vendor team supporting MLOps. Bring a passion to more »
Employment Type: Permanent
Salary: USD Annual
Posted:

principal production engineer- Retail Hardware

Seattle, Washington, United States
Starbucks
other connected devices to the retail environment, we need powerful new capabilities to deploy and manage them. The standard model of wiring up some observability tool and then letting the helpdesk respond to alerts barely scales to the store of today, and when one new thing is deployed to just … variety of devices, contributing and collaborating across domains including wireless and wired network connectivity, third-party platform integration, install planning and validation, automation and observability at hardware/OS/app layers, security, and day 2 fleet management & operational support Lead the production engineering team through the solution design process more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Lead Engineer SRE

Nottingham, Nottinghamshire, East Midlands, United Kingdom
Microlise
strong technical background, then we want to hear from you! As our new Lead Site Reliability Engineer , you will be key to maximising the observability of our infrastructure and applications, and to resolving error-prone manual processes through automation. Technologies you will be using include: Powershell, Python, Ansible, ELK Stack more »
Employment Type: Permanent
Salary: £55,000
Posted:

Monitoring and Observability Engineer

Oxfordshire, South East, United Kingdom
Hybrid / WFH Options
La Fosse Associates Ltd
Monitoring and Observability Engineer Salary - £50,000 - £55,000 - Fully remote role! Principal Accountabilities Design, implement, and manage monitoring solutions to ensure the availability, performance, and reliability of our systems. Collaborate with cross-functional teams to understand system requirements and implement effective monitoring strategies. Utilise expertise in Logic Monitor, OpenSearch … Proficient experience with other monitoring tools such as Dynatrace, New Relic, Splunk, Datadog, Nagios, Prometheus etc. Take ownership of the development of monitoring and observability practices Benefits include: 25 days holiday + statutory Competitive pension match Car allowance Family health care more »
Employment Type: Permanent, Work From Home
Salary: £55,000
Posted:

Senior Manager - SQL Server & Cloud Database Administration

Houston, Texas, United States
United Airlines
as well as with business partners and other technical leaders. Have laser sharp focus on our modernization agenda like cloud migration journey, resiliency, security, observability etc. Have broad decision-making authority and support to guide your team to success. Have the opportunity to grow and explore emerging technologies to help more »
Employment Type: Permanent
Salary: USD Annual
Posted:

IT Architect, Platinion (DigitalBCG)

Colombia
Boston Consulting Group
vs. open source software. • Approaches to managing Architectural debt, Architecture governance and evolution in practice • Micro services topologies, including operational concerns such as resiliency, observability, discovery and routing, security etc. • Have experience with, and understand how to lead, legacy integration and remediation (facades, strangler approaches, et. al.) • Deep understanding of more »
Employment Type: Permanent
Salary: USD Annual
Posted:

Kubernetes Platform Engineer | On-Prem/Public Cloud CloudNative

Greater London, England, United Kingdom
Selby Jennings
applications, with a specific emphasis on Linux RedHat and hands-on experience with RedHat Satellite. Familiarity with Grafana, CI/CD monitoring, and the observability stack, particularly Ansible. Strong understanding of Agile, Site Reliability Engineering (SRE), and DevOps principles and practices. Excellent communication and interpersonal skills for effective collaboration within more »
Posted:

DevOps Consultant

London Area, United Kingdom
Hybrid / WFH Options
McCabe & Barton
infrastructure as code. Implement and maintain CI/CD pipelines using GitLab CI/CD and Jenkins. Manage and monitor SRE systems, including log observability, Application Performance Monitoring (APM), infrastructure monitoring, and security. Proficient in working with Kubernetes for container orchestration and management. Experienced with AWS Cloud services and infrastructure more »
Posted:

Site Reliability Engineer

London Area, United Kingdom
Durlston Partners
the transition from design and construction to maintenance and monitoring has left you yearning for more. Requirements Proficient Python coder. Kubernetes experience. Expertise in Observability platforms and tooling such as Grafana and Prometheus. At least 5 years of experience, ideally in a start up or scale up. We understand that more »
Posted:

Technology Architect

Leeds, England, United Kingdom
Infosys
.Own and manage Architecture evolution + progressive feature elaboration of an application/product (Kinisi) Strongly preferred skills: OpenSearch, Prometheus, Grafana, log management and observability tools/technologies, Java - API/Microservices, UI/UX, Kubernetes, Temporal, Apache Nifi Soft Skills: .Strong skills in communicating architecture, pros and cons of more »
Posted:

Principal Software Engineer

Manchester Area, United Kingdom
Candour Solutions
navigating cloud-based environments effectively. Proficiency in both Python and TypeScript languages. Versatility in employing diverse testing methodologies tailored to specific contexts. Familiarity with observability concepts and terminology essential for ensuring operational software efficiency. Robust understanding of cloud security principles and best practices. Experience with infrastructure as code (e.g. terraform more »
Posted:
Observability
10th Percentile
£50,000
25th Percentile
£64,500
Median
£80,000
75th Percentile
£91,250
90th Percentile
£110,000