applications. Releasing functional software regularly, utilizing trunk-based development, automated testing, and infrastructure-as-code principles. Ensuring the software meets requirements for performance, resilience, observability, maintainability, security, and accessibility. Collaborating with cross-disciplinary teams to build effective working relationships. Aligning team efforts with user needs and Kooth's commercial and More ❯
App Service, Functions, Managed SQL, Redis, Blob Storage). Experience with SQL Server, Atlas MongoDB, Elasticsearch, xUnit, Selenium, and web accessibility testing. Familiarity with observability platforms like Azure Application Insights or DataDog. Proficient with NuGet, GIT, Swagger, OpenAPI, and Terraform. Nice to have: Experience with AWS or GCP. Contributions to More ❯
of all engineering teams. As an Engineering Manager, you will lead and expand two core components: the Feature Flags Service and the Engineering Internal Observability Platform (E360). You'll ensure that these systems serve as the foundation for seamless and efficient engineering workflows, allowing us to deliver top-tier More ❯
Peterborough, Cambridgeshire, United Kingdom Hybrid / WFH Options
BGL Group
Management to support the goals and objectives on your team. You will have a focus on end-to-end responsibility for the development, quality, observability, and testing of the software you build. This role will offer you the opportunity to get hands on with a number of different technologies, however More ❯
Sunderland, Tyne And Wear, United Kingdom Hybrid / WFH Options
Tombola
other teams to foster collaborative practices, streamline processes, and enhance product quality, ultimately improving delivery timelines. Focus on enhancing our technical ecosystem by emphasizing observability practices, ensuring robustness and reliability. Leverage your technical expertise as an integral part of our development team to tackle intricate challenges and construct reliable, well More ❯
Sunderland, Tyne And Wear, United Kingdom Hybrid / WFH Options
Tombola
other teams to foster collaborative practices, streamline processes, and enhance product quality, ultimately improving delivery timelines. Focus on enhancing our technical ecosystem by emphasizing observability practices, ensuring robustness and reliability. Leverage your technical expertise as an integral part of our development team to tackle intricate challenges and construct reliable, well More ❯
data and AI systems and applications in a cloud-first environment. Skilled in engineering ways of working such as CI/CD, release lifecycle, observability, testing, and continuous model validation with a tangible track record of instituting change. Programming experience - ideally in Python or open to using Python. Familiarity with More ❯
projects and products Continuously apply and improve our cloud engineering best practices and standards, and support colleagues in their adoption Implement and maintain modern observability tooling to support the debugging and improvement of workloads hosted in public cloud, private cloud, and on-premise environments What the ideal candidate looks like More ❯
Cloud Operations. Good working technical knowledge (certificates are very welcome) in different cloud technologies and Azure and AWS Cloud Platforms. Experience managing monitoring, alerting, observability, and dashboarding platforms (such as AWS Monitor, Prometheus, Grafana, and Elasticsearch). Good understanding of NOC and DevOps practices. Experience and in-depth knowledge of More ❯
Uxbridge, Middlesex, United Kingdom Hybrid / WFH Options
Avature
monthly for our themed culture days. Plus all our giffgaffers come together at our legendary giffgaff summer, birthday and Christmas celebrations. The Must Haves: Observability, "you build it you run it" attitude Mentoring, good communication, giving and receiving feedback The Other Stuff We Are Looking For: Event-Driven Architecture; SOLID More ❯
in delivery assurance for analytics and AI programmes. Desired Skills: Experience in data ingestion, integration, governance, and solution design. Familiarity with data quality frameworks, observability tools, and automation. WHAT YOU'LL LOVE ABOUT WORKING HERE? As a Senior Manager, you will have the opportunity to work at the forefront of More ❯
such as Dagster , Airflow , or Prefect Knowledge of CI/CD best practices and tools (e.g. GitHub Actions, Jenkins, CodePipeline) Exposure to monitoring and observability tools for ML systems (e.g. Prometheus, Grafana, DataDog, WhyLabs, Evidently, etc.) Experience in building parallelised or distributed model inference pipelines Nice-to-Have Skills Familiarity More ❯
of trading processes, including FIX connectivity, order management, pricing, and market making. Familiarity with ITIL framework processes for incident and problem management. Knowledge of observability and monitoring tools (e.g., ELK, Grafana). Understanding of object-oriented programming languages such as C# .NET and Python is a plus. Why Join Us More ❯
SAP Sovereign Cloud Automation & Tooling team and contribute to the Quality Engineering efforts for the Operations Control Plane (OCP) -a foundational capability responsible for observability, compliance, and policy enforcement in SAP's regulated cloud environments. You will support the test automation strategy for a WebApp architecture powered by Node.js and More ❯
to fault find and identify the root cause. Core Responsibilities involved: 2nd and 3rd line support Incident management Application design and development Site reliability (observability, alerting, high-availability, self-healing systems etc.) Database administration Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Powershell SQL Docker (with More ❯
Establish clear API change management processes to minimize disruption for internal and external consumers. API Monitoring & Performance: Set up and oversee API monitoring and observability tools to ensure high availability and performance. Define and track key performance indicators (KPIs) for APIs, such as response times, uptime, and error rates. Analyze More ❯
with rapid feedback and safe deploys. Writes automation and monitoring code with a focus on speed of delivery and maintainability. Drives the adoption of observability tools and practices for real-time issue detection and resolution. Understands and improves distributed systems, storage, and networking in production. Maintains a strong delivery mindset More ❯
For Delivering robust, fully tested, maintainable software that impacts end users Designing and implementing production-ready scalable NLP applications and APIs Developing monitoring and observability solutions and integration testing frameworks Conducting code reviews and providing constructive feedback to team members Ensuring the scalability, performance, and reliability of AI applications Staying More ❯
data-driven decision-making, risk assessment, KPI framework development, or value tracking and value analysis Experience in ingestion, integration, data engineering, data quality and observability, dataset identification Experience in high-level architecture, solution design or platform licensing Experience in business case development and financial modelling WHAT YOU'LL LOVE ABOUT More ❯
CI/CD and DevOps Practices Experience with CI/CD tools (e.g., Octopus Deploy, GitHub Actions, TeamCity, Jenkins, Azure DevOps). Monitoring and Observability Familiarity with observability concepts and ensuring system reliability. Experience with monitoring tools like Prometheus, Grafana and Sumo Logic. System Configuration and Documentation Experience designing, documenting More ❯
CI/CD and DevOps Practices Experience with CI/CD tools (e.g., Octopus Deploy, GitHub Actions, TeamCity, Jenkins, Azure DevOps). Monitoring and Observability Familiarity with observability concepts and ensuring system reliability. Experience with monitoring tools like Prometheus, Grafana and Sumo Logic. System Configuration and Documentation Experience designing, documenting More ❯
management team, to deliver industry-leading DevOps and Infrastructure products that provide Infrastructure-as-code abstractions and operating principles, leading cloud computing capability, automation, observability, operability, and developer experience. You will drive the product roadmap, guide product development initiatives, and ensure the successful launch and adoption of DevOps and Infrastructure … be a plus: Strong understanding of modern infrastructure and site reliability engineering practice, including Infrastructure-as-code tools (e.g. Terraform, Ansible ) and metrics and observability tools (e.g. Prometheus, Grafana ). Strong understanding of modern DevOps practice, including DevOps stacks (e.g. Jenkins, GitLab, CircleCI ). Cloud experience (e.g. AWS, Google Cloud More ❯
of title, we are committed to achieving ambitious goals and we have fun celebrating our wins. We are looking for a self-motivated Senior Observability Engineer to join our dedicated Observability Infrastructure team. Anaplan is a high-growth company that is leading the way in enterprise planning. We look for … people who believe in simplicity, agility and performance and can choose and use the best tools for the job. In the role of Senior Observability Engineer, you will be designing and improving our approach to collecting and analyzing Observability telemetry (Logs, Metrics and Traces) and visualizing it in Grafana Cloud. … You will implement best observability practices to enable engineers across the business to track service performance and interaction in a scalable, performant, and cost-effective manner. What you'll be doing: In this role, working a minimum of 2 days a week in our York Office, you will be: Work More ❯