Middlesex, south east england, united kingdom Hybrid/Remote Options
Sky
networking and security standards, protocols and best practices Proven experience in logging systems (e.g. ELK stack ) Proven experience in monitoring systems (e.g. Prometheus ) Proven experience in tracing systems (e.g. OpenTelemetry , Jaeger) Experience in performance optimization and resource management Relevant certifications (AWS, Google) Understanding of Agile methodologies Ability to diagnose and resolve service- affecting issues in a Broadcast/Livestream environment More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom Hybrid/Remote Options
Vallum Associates Limited
Preferred Qualifications: OpenShift certifications (e.g., Red Hat Certified Specialist in OpenShift Administration). Experience with multi-cluster and hybrid cloud OpenShift deployments. Familiarity with monitoring and logging tools (e.g., oTel, Grafana, Splunk stack). Knowledge of OpenShift Operators and Helm charts. Experience with large-scale migration projects. More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV More ❯
Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
or .NET preferred) * Cloud experience, ideally AWS, and knowledge of container orchestration (Kubernetes) and Infrastructure as Code (Terraform) * Experience with monitoring and observability tools such as Grafana, Prometheus or OpenTelemetry * Strong understanding of networking fundamentals and distributed systems* Ability to collaborate effectively with engineering, operations and product teams TO BE CONSIDERED: Please either apply through this advert or email me More ❯
Wigan, Greater Manchester, North West, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
or .NET preferred) * Cloud experience, ideally AWS, and knowledge of container orchestration (Kubernetes) and Infrastructure as Code (Terraform) * Experience with monitoring and observability tools such as Grafana, Prometheus or OpenTelemetry * Strong understanding of networking fundamentals and distributed systems * Ability to collaborate effectively with engineering, operations and product teams TO BE CONSIDERED: Please either apply through this advert or email me More ❯
Lambda, Glue, Redshift, OpenSearch) • Hands-on experience deploying AI/LLM-based systems into production • Experience using dbt Cloud for transformation pipelines • Familiarity with tracing and observability (e.g., Langfuse, OpenTelemetry) • Experience preparing datasets and running supervised fine-tuning (SFT) of LLMs • Exposure to reverse ETL tools (e.g., Census, Hightouch) or building custom syncs to HubSpot, Slack, APIs Responsibilities: AI & Application More ❯
Warwick, Warwickshire, West Midlands, United Kingdom Hybrid/Remote Options
Sanderson Government and Defence
ElasticSearch clusters, Kibana dashboards, and Logstash pipelines. Integrate SIEM with cloud-native observability tools (AWS CloudWatch, Azure Monitor, GCP Operations Suite). Automate log collection and enrichment using Beats, OpenTelemetry, and scripting. Security Use Cases & Threat Detection Build and maintain SIEM use cases, alerts, and dashboards for threat detection. Map detection rules to frameworks like MITRE ATT&CK, STRIDE, and More ❯
Austin, Texas, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Bellevue, Iowa, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Pflugerville, Texas, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Papillion, Nebraska, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Omaha, Nebraska, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Taylor, Texas, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Cedar Park, Texas, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Round Rock, Texas, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
La Vista, Nebraska, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Council Bluffs, Iowa, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
San Marcos, Texas, United States Hybrid/Remote Options
Charles Schwab
on site in the specified location(s). This role is responsible for supporting and maintaining enterprise monitoring and telemetry platforms; Confluent Enterprise Platform (i.e., Kafka), ITRS Geneos, and OpenTelemetry telemetry pipeline as a member of the Enterprise Telemetry team. Activities include supporting Kafka producers and consumers, ITRS agent administration, OTEL pipeline management, troubleshooting and resolving issues, identifying opportunities for … include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting telemetry agents. Support of OpenTelemetry collectors Issue troubleshooting and resolution. What you have Deep understanding of the Confluent Enterprise Platform component: Brokers, Topics, Partitions, Producers, Consumers, Zookeeper, KRaft. Ability to setup and configure on-prem … Kafka components, replication factors, and partitioning. E xperience engineering logging platforms Understanding of telemetry monitoring platforms and concepts, like ITRS Geneos, OpenTelemetry agents like Grafana Alloy. Grafana Cloud and Datadog. Deep understanding of security protocols: SSL/TLS, SASL, LDAP, etc. and role-based authentication. Experience working in telemetry monitoring (alerts, events, logs, metrics, and traces). Experience working in More ❯
Wigan, Lancashire, England, United Kingdom Hybrid/Remote Options
Searchability
SITE RELIABILITY ENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and debugging Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and Infrastructure … process and submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS SRE, Site Reliability Engineering, AWS, Kubernetes, Terraform, Grafana, Prometheus, OpenTelemetry, Go, .NET, Cloud Infrastructure, Observability, CI/CD, DevOps, Automation, Performance Tuning, Incident Management More ❯
Wigan, Greater Manchester, United Kingdom Hybrid/Remote Options
Searchability (UK) Ltd
SITE RELIABILITY ENGINEER ESSENTIAL SKILLS At least 2 years' experience working as an SRE Deep understanding of system reliability, scalability and performance tuning Experience with observability tools (Grafana, Prometheus, OpenTelemetry) Proficiency in a programming language such as Go or .NET for automation and debugging Hands-on experience with AWS or another major cloud platform Knowledge of Kubernetes, Terraform, and Infrastructure … process and submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS SRE, Site Reliability Engineering, AWS, Kubernetes, Terraform, Grafana, Prometheus, OpenTelemetry, Go, .NET, Cloud Infrastructure, Observability, CI/CD, DevOps, Automation, Performance Tuning, Incident Management More ❯
rebuilding nearly every component of our observability platform, from data collection to real-time analytics. You will drive core initiatives that move Twilio from fragmented tooling to a unified, OpenTelemetry-first observability stack built for scale. You'll lead technically and strategically-designing platform components, influencing architectural decisions, mentoring engineers, and engaging with teams across Platform Engineering and R&D. … workflows. Design and build developer-friendly tooling and APIs to support incident response, performance analysis, and platform debugging at scale. Leverage (and optionally contribute to) open-source standards like OpenTelemetry to ensure interoperability and extensibility. Champion a pragmatic approach to observability-balancing performance, cost, and user value across diverse engineering teams. Qualifications Twilio values diverse experiences from all kinds of … logging platforms, metrics pipelines, tracing infrastructure, or profiling tools). Lead technical execution for major components of Twilio's observability overhaul, including shift to centralized S3-based data lakes, OpenTelemetry instrumentation, and ClickHouse-backed query engines. Deep proficiency in at least one modern programming language (e.g., Go, Python, Java). Familiarity with high-cardinality data challenges and telemetry correlation techniques. More ❯