tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems; Able to work efficiently with colleagues in multiple locations; Willing to collaborate across domains, for More ❯
tools; JIRA, Confluence; * Experience in monitoring/reporting tools such as Splunk, Grafana/Prometheus etc * Experience in Agile practices * Working knowledge of environment monitoring tools such as GCO, NewRelic, Prometheus, Grafana. * Collaboration Skills: Proactive can-do attitude; A creative approach towards solving technical problems; Able to work efficiently with colleagues in multiple locations; Willing to collaborate across domains, for More ❯
Proven track record of successfully managing complex technical products, preferably in software development, IT operations, or cloud infrastructure environments. Experience working with modern telemetry tools (e.g., Prometheus, Grafana, Datadog, NewRelic, etc.) and automation platforms (e.g., Ansible, Terraform, Jenkins, etc.). Strong understanding of Agile and Scrum methodologies. Strong analytical, problem-solving, and communication skills. Ability to translate … you to bring your true self to work so you can help enrich our diverse workforce. You will be part of a collaborative and creative culture where we encourage new ideas and are committed to sustainability across our global business. You will experience the critical role we have in helping to re-engineer the financial ecosystem to support and More ❯
Location : Colchester (x3 days in the office) Reports to: Head of IT and Support We are seeking a proactive and skilled DevOps Engineer/Analyst to join our growing team in a dynamic hybrid role that blends client consultancy with More ❯
different platforms and types of software, from software engineers and DevOps all the way to the C-suite. Responsibilities Solve customer's technical problems by adopting the platform, integrating new data and existing integrations Understanding customers technical requirements and business goals to consistently create new artifacts and deliver value Lead the onboarding process, from new integrations, creation … with Azure and GCP Background knowledge and hands-on practice in Observability, specifically experience working with one or more of the following tools - Kibana, Open-Search, Grafana, Datadog, Sumologic, NewRelic, AppDynamics, Dynatrace, Prometheus, Logz.io, SignalFX, Instana, Splunk, Honeycomb, Jaeger Hands-on experience with Infrastructure as a Code (Terraform/Ansible) Hands-on experience in technical integrations (OpenTelemetry/fluentd/ More ❯
Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, NewRelic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
Other duties as needed About You 5+ years' experience in Site Reliability Engineer roles Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (Prometheus/Grafana, NewRelic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
Crewe, Cheshire, United Kingdom Hybrid / WFH Options
Manchester Digital
Cognito, Firebase) - Understanding of Site Reliability Engineering and key concepts - Proficient in Infrastructure as Code pipeline deployments and pipeline version control within Terraform or CloudFormation. - Observability Systems, e.g., Nagios, NewRelic - Able to troubleshoot/work under pressure, meet deadlines. - Previous experience in a cloud engineering role. - AWS certified as SysOps Administrator/Solutions Architect/DevOps Engineer. … as Code principles and related tech such as Terraform or CloudFormation - Enhanced experience of AWS cloud technologies, e.g., ECS, EC2, VPC, Lambda, CFS. Ideally AWS certified. - Observability Systems, e.g., NewRelic, CloudWatch, SquadCast - ITIL Qualified or awareness of the framework. Bonus Qualifications: -Experience with Linux system administration and troubleshooting. -Basic knowledge of AWS cloud technologies such as EC2 … S3, RDS, and ECS. -Exposure to databases such as MySQL, MariaDB, or Cassandra. -Understanding of monitoring tools (NewRelic, CloudWatch, Nagios). -Awareness of ITIL frameworks and structured service operations. -NISC2 or similar security qualifications or knowledge Why should you join us? Brand new collaborative, open-concept office for those days when you want to work collaboratively More ❯
Java, Swift, and Xamarin for our mobile apps. Responsibilities Will be a core member of the team to design, architect, develop, code reviews and test our key applications Design new application features and integrations in collaboration with team members to deliver complex changes Design and implement scalable and resilient cloud solutions with security and disaster recovery in mind Help … or equivalent): EC2, S3, CloudFront, Elastic Beanstalk, Dynamo DB Basic networking knowledge and troubleshooting Experience with any of the following tools and technologies: Atlassian Jira, GitHub, Azure DevOps, Aha!, NewRelic, Sumo Logic More ❯
SNS, RDS, DynamoDB, KMS, SSM, ELB, VPC, or Route53 Experience with hardened AMIs and Container Images, such as DoD STIGs and CIS Benchmarks Experience with APM, such as Datadog, NewRelic, or Splunk Experience in Rancher services, including RKE2 Experience in Identity Authentication and Authorization, including Single-Sign-On, SAML, or OpenID Experience in Active Directory and GPO More ❯
Site Reliability Engineer roles 3+ years' experience with an object-oriented language (preferably Java, .NET or C++) Expert+ level Linux administration, scripting, and troubleshooting Demonstrable knowledge of Observability tools (NewRelic, Splunk, DataDog) Comprehensive experience with AWS (Amazon Web Services) and its core capabilities (VPC, EC2, ECS, Route53, Fargate, ALB/NLB distributions, etc) Extensive experience with cloud More ❯
Proven expertise and experience with database technologies including NoSQL databases like MongoDB, RDBMS such as Postgres and MySQL Exposure to Docker, Kubernetes, AWS, Helm, Terraform, Vault, Grafana, ELK Stack, NewRelic Relevant experience in the maintenance of data APIs and data lake architectures, including experience with Apache Iceberg, Trino/Presto, Clickhouse, Snowflake, BigQuery. Master's degree in More ❯
on large-scale production systems, delivering highly impactful products that make a difference to our millions of users. As a MLOps Engineering Manager at Trainline you will Build a new team of MLOps Engineers working alongside ML Engineers, Data Engineers, Software Engineers, Data Scientists and Product Managers Define MLOps processes and steer tooling and infrastructure choices across the technology … like MLFlow and Airflow, or on common problems such as model and API monitoring, data drift and validation, autoscaling, access permissions Have previously worked with monitoring tools such as NewRelic or Grafana Understand the use of feature stores and related data technologies for operational machine learning products Are proficient with Python and have Spark knowledge. Have leadership …/DS libraries (scikit-learn, numpy, pandas, LightGBM, LangChain/LangGraph, TensorFlow, etc ) PySpark AWS cloud infrastructure: EMR, ECS, ECR, Athena, etc. MLOps: Terraform, Docker, Spacelift, Airflow, MLFlow Monitoring: NewRelic CI/CD: Jenkins, Github Actions More information: Enjoy fantastic perks like private healthcare & dental insurance, a generous work from abroad policy, 2-for-1 share purchase More ❯
workflows and integrating AI capabilities into applications Experience with cloud deployment services (AWS preferred, Azure, GCP) Knowledge of containerization and orchestration (Docker, Kubernetes) Experience with observability tools (Prometheus, Grafana, NewRelic, Datadog More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
ASDA
contributes to the delivery of our mission through a combination of design, build & implementation activity together with working alongside multiple engineering teams across the department, supporting them to adopt new ways of working. The role will involve strong collaboration with others - be it learning from examples of existing great engineering work, collaborating with others to create it or mentoring … use of automation and the CICD toolchain Have experience working to an Agile methodology (e.g. XP, Kanban, Scrum, SAFe) Knowledge of monitoring and alerting tools: Knowledge of tools like NewRelic or Azure Monitor and Azure App Insights Cloud platform knowledge: Experience with container, serverless and other cloud technologies Infrastructure as Code (IaC): Knowledge of tools such as More ❯
languages such as Python, Bash, TypeScript, and PowerShell. Familiarity with DevOps & Site Reliability Engineering (SRE) principles , practices , and tools . Hands-on experience with monitoring and logging solutions (e.g., NewRelic, Coralogix , AWS CloudWatch, Azure Monitor). S t rong problem-solving , stakeholder management , written, and verbal communication skills. Proven experience managing multiple projects and topics simultaneously. Work … Days Ago Elsevier is a global information analytics business that helps institutions and professionals advance healthcare, open science and improve performance for the benefit of humanity.We help researchers make new discoveries, collaborate with their colleagues, and give them the knowledge they need to find funding. We help governments and universities evaluate and improve their research strategies. We help doctors More ❯
Sheffield, Yorkshire, United Kingdom Hybrid / WFH Options
Reach Studios Limited
Rules, Rate Limiting, Workers, Access, and Bot Management Zero Trust and tunnel configurations Automating infrastructure and configuration with Ansible, Terraform, or similar Monitoring and improving performance with tools like NewRelic, Sentry, TideWays Implementing high-availability and scalable solutions for LAMP/LEMP stacks Managing secure, repeatable deployments for Magento 2, Laravel, and React apps Championing DevOps culture More ❯
and observability. We are looking for solution-oriented, commercially minded, customer-focused individuals, used to working in an agile environment who want to be a part of building something new from the ground up within a diverse and inclusive team. Culture is as important to us and we are looking for intellectually curious, new technology passionate individuals who … would like to expand their skills whilst working on a new exciting venture for the firm. Your work will have a massive impact, both on us as a company, as well as our clients and our business partners around the world. As a Site Reliability Engineer III at JPMorgan Chase within the Corporate Technology - Market Risk, you will solve … in learning technology that facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, NewRelic, CloudWatch, AppDynamics, Splunk, Geneos. More ❯
and container orchestration. Support multi-tenancy and environment rationalization to reduce duplication and inefficiency. Define and implement observability standards, including logging, metrics, tracing, and alerting . Use tools like NewRelic , Prometheus , and Grafana , alongside building custom instrumentation for key platform services. Drive incident readiness and operational resilience by enabling actionable monitoring and alerting. Drive cloud cost visibility … in building and operating developer platforms and enablement frameworks. Experience with cloud-native technologies, Kubernetes, and Infrastructure as Code (Terraform, Helm, etc.). Strong understanding of observability tooling (especially NewRelic, Prometheus, Grafana) and incident response best practices. Familiarity with FinOps, platform cost tracking, and infrastructure efficiency techniques. Excellent communication, leadership, and stakeholder management skills. Attract, hire, and More ❯
and container orchestration. Support multi-tenancy and environment rationalization to reduce duplication and inefficiency. Define and implement observability standards, including logging, metrics, tracing, and alerting . Use tools like NewRelic , Prometheus , and Grafana , alongside building custom instrumentation for key platform services. Drive incident readiness and operational resilience by enabling actionable monitoring and alerting. Drive cloud cost visibility … in building and operating developer platforms and enablement frameworks. Experience with cloud-native technologies, Kubernetes, and Infrastructure as Code (Terraform, Helm, etc.). Strong understanding of observability tooling (especially NewRelic, Prometheus, Grafana) and incident response best practices. Familiarity with FinOps, platform cost tracking, and infrastructure efficiency techniques. Excellent communication, leadership, and stakeholder management skills. Attract, hire, and More ❯
new monitoring queries to drive our alerting, or coordinating across multiple teams to manage the response to an incident. Our technology stack: AWS (including ECS and RDS), OpenTelemetry, NewRelic, Python, Postgres, Liquibase, Angular, Docker Who you are: Four or more years professional experience in a customer-facing technical support or engineering role Excellent verbal and written communication skills, with … internal and client-facing platforms Coordinating any response required to issues with the platform, taking ultimate responsibility for seeing incidents through to resolution Assisting with the technical onboarding of new B2B clients, helping them get up and running with our API Working to improve our ability to effectively support our platform, including improving our monitoring and alerting capabilities Innovating More ❯
should demonstrate that you have: The ambition for creating and nurturing a culture of DevOps across a Technology organisation; working with existing infrastructure and Software Engineering teams to define new practices and evolve ways of working. Experience with the various forms of Cloud infrastructure, hosting and services e.g. IaaS, PaaS, Serverless computing and all-in-one cloud-based solutions. … knowledge of containerisation and orchestration tools such as Docker and Kubernetes, ideally running on Azure (AKS). Experience integrating and configuring various logging, monitoring, and alerting tools (e.g. Splunk, NewRelic) that provide operational insight into the health of live applications and systems. Experience in applying a range of cloud security tools and techniques (e.g. threat modelling, vulnerability More ❯
we expect you to be a key contributor in promoting this mindset. What can you expect from working with us? Contribute to our technological direction - We have lots of new systems to design and build along with existing platforms to maintain and operate so there are plenty of opportunities for you to get involved. We need your help to … push the boundaries of quality, increase our test coverage and automation, quality awareness and try new things. Agile, cross-functional working - We work in autonomous teams consisting of Product Owner, UI/UX Designers, QA, and Front and Back End Engineers. Depending on the undertaking, we also embed or collaborate with others from across the business such as Infrastructure … Driven Development (BDD) to define and capture acceptance criteria Excellent analytical thinking and problem-solving skills Great communication and test coordination skills Debugging and analysis of issues (we use NewRelic and AWS Cloudwatch) Visual difference testing (we use Percy) API testing (we use Postman) Front end testing using Javascript (we use Cypress) Cross browser/Device testing (we use BrowserStack More ❯
Skills & Qualifications: Experience with NoSQL databases (e.g., MongoDB, Cassandra, Redis) is a plus. Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes). Experience with performance monitoring tools (e.g., NewRelic, Datadog, Prometheus). Knowledge of data warehousing concepts and ETL processes. Certification in database administration (e.g., Microsoft SQL Server, Oracle DBA, MySQL Certification) is a plus. SCA More ❯
Skills & Qualifications: Experience with NoSQL databases (e.g., MongoDB, Cassandra, Redis) is a plus. Familiarity with containerization and orchestration technologies (e.g., Docker, Kubernetes). Experience with performance monitoring tools (e.g., NewRelic, Datadog, Prometheus). Knowledge of data warehousing concepts and ETL processes. Certification in database administration (e.g., Microsoft SQL Server, Oracle DBA, MySQL Certification) is a plus. SCA More ❯