Implement secure architecture and platform hardening aligned with defence-grade standards, supporting identity, access control, encryption, and system resilience. Monitoring & Continuous Improvement Setup and maintain monitoring solutions (eg ELK, OpenTelemetry, Prometheus), troubleshoot performance, and deliver root cause analysis and remediation. What We're Looking For DV Clearance : Active Developed Vetting clearance is essential . Systems Engineering Experience : 2nd/3rd More ❯
Implement secure architecture and platform hardening aligned with defence-grade standards, supporting identity, access control, encryption, and system resilience. Monitoring & Continuous Improvement Setup and maintain monitoring solutions (e.g. ELK, OpenTelemetry, Prometheus), troubleshoot performance, and deliver root cause analysis and remediation. What We're Looking For DV Clearance : Active Developed Vetting clearance is essential . Systems Engineering Experience : 2nd/3rd More ❯
Implement secure architecture and platform hardening aligned with defence-grade standards, supporting identity, access control, encryption, and system resilience. Monitoring & Continuous Improvement Setup and maintain monitoring solutions (e.g. ELK, OpenTelemetry, Prometheus), troubleshoot performance, and deliver root cause analysis and remediation. What We're Looking For DV Clearance : Active Developed Vetting clearance is essential . Systems Engineering Experience : 2nd/3rd More ❯
Implement secure architecture and platform hardening aligned with defence-grade standards, supporting identity, access control, encryption, and system resilience. Monitoring & Continuous Improvement Setup and maintain monitoring solutions (e.g. ELK, OpenTelemetry, Prometheus), troubleshoot performance, and deliver root cause analysis and remediation. What We're Looking For DV Clearance : Active Developed Vetting clearance is essential . Systems Engineering Experience : 2nd/3rd More ❯
Implement secure architecture and platform hardening aligned with defence-grade standards, supporting identity, access control, encryption, and system resilience. Monitoring & Continuous Improvement Setup and maintain monitoring solutions (e.g. ELK, OpenTelemetry, Prometheus), troubleshoot performance, and deliver root cause analysis and remediation. What We're Looking For DV Clearance : Active Developed Vetting clearance is essential . Systems Engineering Experience : 2nd/3rd More ❯
to translate complexity into clarity Experience with Terraform, Helm, or GitOps tooling Familiarity with front-end technologies such as React and TypeScript Exposure to GraphQL, observability stacks (e.g., Prometheus, OpenTelemetry), or large-scale data platforms Prior work in regulated industries (BFSI, telecom, public sector) To succeed in this role, you'll bring more than just technical knowledge. You'll demonstrate More ❯
lifecycle tools, model monitoring, and versioning Exposure to tools like KServe, Ray Serve, Triton, or vLLM is a big plus Bonus Points Experience with observability frameworks like Prometheus or OpenTelemetry Knowledge of ML libraries: TensorFlow, PyTorch, HuggingFace Exposure to Azure or GCP Passion for financial services Qualifications Degree in Computer Science, Engineering, Data Science, or similar What We Offer A More ❯
infrastructure, CI/CD pipelines, and cloud networking Demonstrated ability to manage application observability, triage and monitoring Technical Competencies (Desirable): Familiarity with Auth0, AWS Cognito, Helm, Prometheus/Grafana, OpenTelemetry or Honeycomb Experience with CI/CD pipelines for containerised and serverless environments Knowledge of additional cloud platforms such as GCP or Azure Benefits Market-leading salary More ❯
experience in technical integrations and POCs Comfortable coding in any high-level programming language (Java, Go, Python) Strong hands-on knowledge of Kubernetes, AWS, Azure, GCP, Docker, Prometheus, and OpenTelemetry Industry knowledge and opinions on Monitoring, Observability, Log Management, SIEM Engineering/DevOps Background - advantage Experience in Technical Sales of Log Analytics/Monitoring/APM/SIEM - advantage Cultural More ❯
/Accounts - AWS Control Tower, GCP Resource Manager, etc. Network - AWS Transit Gateway, GCP Shared VPC, AWS Route53, GCP Cloud DNS, etc. Observability - AWS OpenSearch, GCP Monitoring/Traces, OpenTelemetry, Grafana, Prometheus, etc. Automation Prowess: Hands-on experience with modern Infrastructure as Code (IaC) automation tools and frameworks (Terraform, Jenkins, Ansible, etc.). Software Development Acumen: A software development background More ❯
e.g., Jenkins, GitLab CI/CD). Fluent in containerisation and orchestration tools (e.g., Docker, Kubernetes). Experience with Helm Charts for managing Kubernetes clusters (EKS). Experience with OpenTelemetry and the LGTM stack of observability tools Experience working in agile environments and a strong agile mindset. Attention to detail and the ability to work under pressure without being distracted More ❯
experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to get hands on when needed. Excellent More ❯
cloud (preferably Azure) using Terraform and Kubernetes. Manage CI/CD pipelines using GitHub Actions and ensure smooth delivery to production. Own monitoring, alerting, and observability, using tools like OpenTelemetry and Dynatrace. Security & Compliance: Champion secure coding practices and data protection across services. Collaboration & Mentoring: Work closely with product owners, engineering leads, and other stakeholders to shape technical solutions. Mentor More ❯
access management in Azure Help integrate platform components with client environments, participating in deployments, troubleshooting, and documentation Drive observability and resilience across environments using tools like Prometheus, Grafana, and OpenTelemetry Troubleshoot and resolve issues across infrastructure, containers, and deployment pipelines Contribute to internal and client-facing infrastructure documentation and support materials Cloud Platforms: Azure (primary), with some AWS, GCP exposure More ❯
systems by design Nice-to-Haves: Exposure to regulated environments (e.g., BFSI, healthcare, public sector) Experience with performance, security, or chaos testing Familiarity with observability tooling (e.g., Prometheus, Grafana, OpenTelemetry) Knowledge of contract testing, mocking, or service virtualization Mindset & Cultural Fit A builder's mindset, focused on enabling early, frequent, and safe delivery through automated confidence A belief that quality More ❯
Linux fundamentals. Curiosity and the confidence to ask questions in a fast-moving team. Nice-to-haves Exposure to Kubernetes, Docker or Terraform. Experience with observability stacks (Grafana, Prometheus, OpenTelemetry). Familiarity with Postgres. Interest in data-privacy, AdTech/MarTech or large-scale data processing. Familiarity with Kafka, gRPC or Apache Spark. As well as working as part of More ❯
technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. Demonstrated More ❯
below) is advantageous: Kotlin Cloud Technologies (Kubernetes, Open Shift) Messaging Technologies (Kafka, Solace, TIBCO) Database/Data Store/Data Query Technologies (SQL Server, Trino, Mongo, S3) Observability Technologies (OpenTelemetry, Elastic Stack/ELK, Grafana) This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required. Job Family More ❯
debugging issues in client Windows environments Refactoring components to improve system quality - performance optimizations and network improvements Helping us build up our tools for observability and distributed tracing (using OpenTelemetry and Grafana) Keeping the Mimica platform up-to-date with the latest framework developments and devising innovative solutions in the Intelligent Automation space Documenting procedures and guides to facilitate knowledge More ❯
patterns, and packaging. Familiarity with building performant and reliable Python systems, including low-level C/C++ extensions (e.g., using pybind11, Cython) and instrumentation for production telemetry (e.g., Prometheus, OpenTelemetry). A proactive ownership mindset and the ability to navigate ambiguity. Excellent collaboration and communication skills for working effectively with teams and stakeholders. Ideally Professional experience GPGPU programming (e.g., CUDA More ❯