tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems; Able to work efficiently with colleagues in multiple locations; Willing to collaborate across domains, for More ❯
tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems; Able to work efficiently with colleagues in multiple locations; Willing to collaborate across domains, for More ❯
Location : Colchester (x3 days in the office) Reports to: Head of IT and Support We are seeking a proactive and skilled DevOps Engineer/Analyst to join our growing team in a dynamic hybrid role that blends client consultancy with More ❯
different platforms and types of software, from software engineers and DevOps all the way to the C-suite. Responsibilities Solve customer's technical problems by adopting the platform, integrating new data and existing integrations Understanding customers technical requirements and business goals to consistently create new artifacts and deliver value Lead the onboarding process, from new integrations, creation … with Azure and GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/ More ❯
Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, NewRelic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
Crewe, Cheshire, United Kingdom Hybrid / WFH Options
Manchester Digital
Cognito, Firebase) - Understanding of Site Reliability Engineering and key concepts - Proficient in Infrastructure as Code pipeline deployments and pipeline version control within Terraform or CloudFormation. - Observability Systems, e.g., Nagios, NewRelic - Able to troubleshoot/work under pressure, meet deadlines. - Previous experience in a cloud engineering role. - AWS certified as SysOps Administrator/Solutions Architect/DevOps Engineer. … as Code principles and related tech such as Terraform or CloudFormation - Enhanced experience of AWS cloud technologies, e.g., ECS, EC2, VPC, Lambda, CFS. Ideally AWS certified. - Observability Systems, e.g., NewRelic, CloudWatch, SquadCast - ITIL Qualified or awareness of the framework. Bonus Qualifications: -Experience with Linux system administration and troubleshooting. -Basic knowledge of AWS cloud technologies such as EC2 … S3, RDS, and ECS. -Exposure to databases such as MySQL, MariaDB, or Cassandra. -Understanding of monitoring tools (NewRelic, CloudWatch, Nagios). -Awareness of ITIL frameworks and structured service operations. -NISC2 or similar security qualifications or knowledge Why should you join us? Brand new collaborative, open-concept office for those days when you want to work collaboratively More ❯
Site Reliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (NewRelic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
Site Reliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstratable knowledge of Observability tools (NewRelic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
on large-scale production systems, delivering highly impactful products that make a difference to our millions of users. As a MLOps Engineering Manager at Trainline you will Build a new team of MLOps Engineers working alongside ML Engineers, Data Engineers, Software Engineers, Data Scientists and Product Managers Define MLOps processes and steer tooling and infrastructure choices across the technology … like MLFlow and Airflow, or on common problems such as model and API monitoring, data drift and validation, autoscaling, access permissions Have previously worked with monitoring tools such as NewRelic or Grafana Understand the use of feature stores and related data technologies for operational machine learning products Are proficient with Python and have Spark knowledge. Have leadership …/DS libraries (scikit-learn, numpy, pandas, LightGBM, LangChain/LangGraph, TensorFlow, etc ) PySpark AWS cloud infrastructure: EMR, ECS, ECR, Athena, etc. MLOps: Terraform, Docker, Spacelift, Airflow, MLFlow Monitoring: NewRelic CI/CD: Jenkins, Github Actions More information: Enjoy fantastic perks like private healthcare & dental insurance, a generous work from abroad policy, 2-for-1 share purchase More ❯
Java, Swift, and Xamarin for our mobile apps. Responsibilities Will be a core member of the team to design, architect, develop, code reviews and test our key applications Design new application features and integrations in collaboration with team members to deliver complex changes Design and implement scalable and resilient cloud solutions with security and disaster recovery in mind Help … or equivalent): EC2, S3, CloudFront, Elastic Beanstalk, Dynamo DB Basic networking knowledge and troubleshooting Experience with any of the following tools and technologies: Atlassian Jira, GitHub, Azure DevOps, Aha!, NewRelic, Sumo Logic More ❯
should demonstrate that you have: The ambition for creating and nurturing a culture of DevOps across a Technology organisation; working with existing infrastructure and Software Engineering teams to define new practices and evolve ways of working. Experience with the various forms of Cloud infrastructure, hosting and services e.g. IaaS, PaaS, Serverless computing and all-in-one cloud-based solutions. … knowledge of containerisation and orchestration tools such as Docker and Kubernetes, ideally running on Azure (AKS). Experience integrating and configuring various logging, monitoring, and alerting tools (e.g. Splunk, NewRelic) that provide operational insight into the health of live applications and systems. Experience in applying a range of cloud security tools and techniques (e.g. threat modelling, vulnerability More ❯
Proven expertise and experience with database technologies including NoSQL databases like MongoDB, RDBMS such as Postgres and MySQL Exposure to Docker, Kubernetes, AWS, Helm, Terraform, Vault, Grafana, ELK Stack, NewRelic Relevant experience in the maintenance of data APIs and data lake architectures, including experience with Apache Iceberg, Trino/Presto, Clickhouse, Snowflake, BigQuery. Master's degree in More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
William Hill PLC
preferably Terraform. A good working knowledge of configuration management, provisioning and orchestration tools with an understanding of use and value of centralised Monitoring/Logging systems like Splunk and New Relic. Good understanding of networking fundamentals, firewalls, load balancers, and across AWS account communication with experience of working in an Agile development environment. What we offer: Our roles offer … catch-up! More about evoke: We're a business that embraces change and progress. The power behind big name brands William Hill, 888 and Mr Green, evoke is the new name for 888 Holdings. Marking a new sense of purpose, direction and ambition for the business, there couldn't be a more exciting time to join us as More ❯
and observability. We are looking for solution-oriented, commercially minded, customer-focused individuals, used to working in an agile environment who want to be a part of building something new from the ground up within a diverse and inclusive team. Culture is as important to us and we are looking for intellectually curious, new technology passionate individuals who … would like to expand their skills whilst working on a new exciting venture for the firm. Your work will have a massive impact, both on us as a company, as well as our clients and our business partners around the world. As a Site Reliability Engineer III at JPMorgan Chase within the Corporate Technology - Market Risk, you will solve … in learning technology that facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, NewRelic, CloudWatch, AppDynamics, Splunk, Geneos. More ❯
and container orchestration. Support multi-tenancy and environment rationalization to reduce duplication and inefficiency. Define and implement observability standards, including logging, metrics, tracing, and alerting . Use tools like NewRelic , Prometheus , and Grafana , alongside building custom instrumentation for key platform services. Drive incident readiness and operational resilience by enabling actionable monitoring and alerting. Drive cloud cost visibility … in building and operating developer platforms and enablement frameworks. Experience with cloud-native technologies, Kubernetes, and Infrastructure as Code (Terraform, Helm, etc.). Strong understanding of observability tooling (especially NewRelic, Prometheus, Grafana) and incident response best practices. Familiarity with FinOps, platform cost tracking, and infrastructure efficiency techniques. Excellent communication, leadership, and stakeholder management skills. Attract, hire, and More ❯
and container orchestration. Support multi-tenancy and environment rationalization to reduce duplication and inefficiency. Define and implement observability standards, including logging, metrics, tracing, and alerting . Use tools like NewRelic , Prometheus , and Grafana , alongside building custom instrumentation for key platform services. Drive incident readiness and operational resilience by enabling actionable monitoring and alerting. Drive cloud cost visibility … in building and operating developer platforms and enablement frameworks. Experience with cloud-native technologies, Kubernetes, and Infrastructure as Code (Terraform, Helm, etc.). Strong understanding of observability tooling (especially NewRelic, Prometheus, Grafana) and incident response best practices. Familiarity with FinOps, platform cost tracking, and infrastructure efficiency techniques. Excellent communication, leadership, and stakeholder management skills. Attract, hire, and More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
William Hill PLC
or similar Infrastructure as Code tools for streamlined management Containerization:Skilled in Kubernetes administration and orchestration Developer Experience:Experienced in developing SDLC pipelines with GitOps Observability:Familiar with Prometheus, NewRelic, Splunk, or similar monitoring tools Security First:Demonstrates an understanding of security best practices in every workflow with an Agile Mindset you'll be an effective communicator … catch-up! More about evoke: We're a business that embraces change and progress. The power behind big name brands William Hill, 888 and Mr Green, evoke is the new name for 888 Holdings. Marking a new sense of purpose, direction and ambition for the business, there couldn't be a more exciting time to join us as More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Reach Studios Limited
Rules, Rate Limiting, Workers, Access, and Bot Management Zero Trust and tunnel configurations Automating infrastructure and configuration with Ansible, Terraform, or similar Monitoring and improving performance with tools like NewRelic, Sentry, TideWays Implementing high-availability and scalable solutions for LAMP/LEMP stacks Managing secure, repeatable deployments for Magento 2, Laravel, and React apps Championing DevOps culture More ❯
we expect you to be a key contributor in promoting this mindset. What can you expect from working with us? Contribute to our technological direction - We have lots of new systems to design and build along with existing platforms to maintain and operate so there are plenty of opportunities for you to get involved. We need your help to … push the boundaries of quality, increase our test coverage and automation, quality awareness and try new things. Agile, cross-functional working - We work in autonomous teams consisting of Product Owner, UI/UX Designers, QA, and Front and Back End Engineers. Depending on the undertaking, we also embed or collaborate with others from across the business such as Infrastructure … Driven Development (BDD) to define and capture acceptance criteria Excellent analytical thinking and problem-solving skills Great communication and test coordination skills Debugging and analysis of issues (we use NewRelic and AWS Cloudwatch) Visual difference testing (we use Percy) API testing (we use Postman) Front end testing using Javascript (we use Cypress) Cross browser/Device testing (we use BrowserStack More ❯
areas include: Video: Continuing the Mimir rollout and addressing an extensive list of feature requests. Print: Tackling a significant challenge-simplifying our print publishing processes and technology by leveraging new automation tools. Your Role We're looking for a passionate Mid-Level Software/DevOps Engineer to join our team with the prospect of leading the development, management, and … tools across MySQL/MariaDB along with SQL queries and procedures. The role will include utilisation of AWS Cloud technologies and infrastructure, including monitoring tools such as AWS CloudWatch, NewRelic and Zabbix. It will encompass AWS services and technologies particularly related to hosting and scaling applications, such as EC2, S3, Lambda, and IAM, and knowledge of concepts … as required. Systems Maintenance and Optimisation: Develop, Configure and/or maintain various systems, in-house integrations and applications within the print ecosystem to facilitate the publishing workflow (inc. new features and workflows, as required). Deployment of configuration and software changes to all environments (inc. documentation of releases for users and technical resources). Perform capacity planning, and More ❯
and Cloud team is staffed with senior leadership from Netflix, Cloudflare, Amazon, Fastly and other major corporations. Our mission is to rebuild Imperva's pops and core infrastructure with new modern technologies, embracing Infrastructure as code at all levels with automation as a core requirement for all projects. We are looking for an Observability Engineer to work within our … of professional experience, some of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, NewRelic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and willingness to get hands … countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Thales UK is committed to providing an inclusive and barrier-free recruitment process. We will provide More ❯
is a plus Knowledge of Redis and log queries is a plus Experience in automations/AI would be an advantage Experience administering multiple monitoring systems such as Datadog, NewRelic, Kubernetes, Grafana and Elastic Cloud Experience with Cloud Computing, AWS, Microservices Architecture, Unix and Linux Systems Life @ Empowered to think big. Try new opportunities while working with a talented … ambitious and supportive team. Transformational and proactive working environment. Elevate employees to find thoughtful and innovative solutions. Growth from within. We help to develop new skill-sets that would impact the shaping of your personal and professional growth. Work Culture. Our colleagues are some of the best in the industry; we are all here to help and support one More ❯
in implementing changes while following ITIL change management processes. Understanding of basic security principles and best practices for securing infrastructure. Optional but advantageous technical skills: Proficient using observability tools (NewRelic and Thousand Eyes), BI platform and data visualisation tools (such as Tableau and Power BI) and technology tools (Jira, Confluence). System Administration: Proficiency in Linux/Unix and Windows More ❯
the work has only just begun! The role We are looking for a Senior Platform Engineer to join Rated Labs. You will be spearheading the effort to integrate a new and exciting product with a series of APM stacks. You will be part of a talent-dense, multidisciplinary team, with strong product sensibility and user-mindedness, backed by some … aligned. ️ The skills Proven experience as a DevOps Engineer, Site Reliability Engineer, Platform Engineer or similar role. Ideally in an entreprise-grade Experience with APM stacks such as Datadog, NewRelic, Prometheus or similar. Experience with handling telemetry, tracing and logging data, at scale, in multiple different environments. Familiarity with low-level telemetry daemons and aggregators such as … and growing!), working in a hybrid/remote set-up, with a home base in London, UK. The offer Competitive salary ($110,000-$150,000) & equity package Flexible PTO New MacBook Budget for conferences and learning Hybrid/remote work set-up with a homebase in London We are also offering visa sponsorship for candidates who wish to relocate More ❯
There are several key areas that require constant development: Automation ?" Enhancements to the automation build process. Enhancements are continuously added to the base repository as well as additions of new technologies. Scripting and Tools ?" There are many processes, tasks that can be made easier using scripts or custom designed tools in a managed services environment. We are looking for … sure the wheels are greased and the ship is humming along Automation, Packaging, Deployments (Custom Hypervisor, Puppet, Git, scripting, CI services) Source Code version control systems: Git & SVN Monitoring: NewRelic, PagerDuty, etc Programming Languages: Bash, PHP, GoLang Virtualisation: We have a custom hypervisor (ZPF) but things like VMWare ESXi/KVM/oVirt/RHEV Classics: Apache, MySQL, Redis, Varnish … NTP, SMTP, Jenkins A solid background in Linux System Administration Functions and responsibilities This is a dynamic and varied role, which includes things like: Build, deployment and maintenance of new and existing environments Designing and maintaining highly available architectures Security (patch management, vulnerability scanning & monitoring) Uptime ?" design and maintain the systems and networking for uptime, resilience, architecting for failure. More ❯