in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
responsibilities include: Monitoring the performance and reliability of the company's global online platforms Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues Exploring and delivering on more »
the lead on projects to improve our DevOps: CI/CD pipeline (vulnerability scanning, static analysis, tests), blue/green deploys, auto load balancing, observability & instrumentation, infrastructure as code (eg Terraform) etc. Take the lead on projects to refactor our codebase, separating domain-specific logic, application logic and UI code more »
using Python and TypeScript/JavaScript. Strong API design skills and a familiarity with building web applications. A proponent of great testing, first-class observability and automating everything. Familiarity with security principles and practices, including authentication, authorization, and encryption. Experience with cloud technologies (e.g. Docker, Kubernetes, Helm & Terraform) and platforms more »
youll do Working as part of a team or as the engagement lead with experience in multicloud networking, SASE, datacentre networking, network simplification, automation, observability and application networking and a strong understanding and alignment of security as it relates to the previous technology areas. Leveraging this experience to deliver: Enterprise more »
Company DescriptionDynatrace exists to make software work perfectly. Our platform combines broad and deep observability and continuous runtime application security with advanced AIOps to provide answers and intelligent automation from data. This enables innovators to modernize and automate cloud operations, deliver software faster and more securely, and ensure flawless digital more »
and upgrading data pipelines and self-service tooling to provide clean, efficient results Writing automated tests to validate requirements Promoting data governance through documentation, observability and controls Using version control and performing code reviews Promoting the adoption of tools and best practices across the team Skills and experience Essential skills more »
Dynatrace and has demonstrable experience (at least 3 years) as a Site Reliability Engineer. The Site Reliability Engineer (SRE) will take ownership of the observability suite, leveraging deep DevOps skills and experience to proactively enhance the performance and stability of APIs and applications. This role will play a crucial part … ensuring reliability and scalability including managing APM tools such as Dynatrace or New Relic. Main Responsibilities as Site Reliability Engineer: Take ownership of the observability suite, including monitoring, logging, and alerting tools, to ensure comprehensive and holistic visibility into system performance and health. Configure and manage APM tools such as … expertise in monitoring, logging, and alerting tools such as Prometheus, ELK stack, Grafana, Azure Monitor etc., with the ability to take ownership of the observability suite. Experience managing APM tools such as Dynatrace or New Relic, utilizing their capabilities to monitor application performance effectively. Deep understanding of DevOps principles and more »
Reigate, England, United Kingdom Hybrid / WFH Options
Client Server
collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include managing and configuring observability platforms such as DataDog and PagerDuty to provide proactive monitoring of production (and other) environments, design and implementation of automation processes to drive efficiencies, leading … similar SRE/Site Reliability Engineer position You have experience of running 24x7 services in the public cloud - Azure preferred You have experience with observability tools such as DataDog and PagerDuty You have a good knowledge of Containerisation - Kubernetes, AKS You have strong scripting skills for automation, PowerShell or Python more »
Life on the team You will be engaging with some of the UKs largest and most high profile enterprise organisations around our Cloud and Application solutions portfolio. You will be supporting customers in their respective evolutions, identifying the natural solution more »
Life on the team You will be engaging with some of the UKs largest and most high profile enterprise organisations around our Cloud and Application solutions portfolio. You will be supporting customers in their respective evolutions, identifying the natural solution more »
Hatfield, Hertfordshire, South East, United Kingdom
COMPUTACENTER (UK) LIMITED
Life on the team You will be engaging with some of the UKs largest and most high profile enterprise organisations around our Cloud and Application solutions portfolio. You will be supporting customers in their respective evolutions, identifying the natural solution more »
Life on the team You will be engaging with some of the UKs largest and most high profile enterprise organisations around our Cloud and Application solutions portfolio. You will be supporting customers in their respective evolutions, identifying the natural solution more »