Code (IaC) using tools like Terraform and ARM templates. Hands-on experience and understanding of containerization and orchestration with Azure Kubernetes and Docker . Design and implement monitoring and observability solutions to ensure the health and performance of cloud resources and applications. Identify opportunities to optimize cloud resources, improve performance, and reduce costs through monitoring and analysis of system metrics. More ❯
recognize road blocks and demonstrates interest in learning technology that facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. #J-18808-Ljbffr More ❯
the business succeed. Provide timely and effective technical support for end users of a designated set of DevOps tools, encompassing traditional tools (e.g., CI/CD platforms, monitoring and observability tools, source code management systems) and GenAI-powered tools. Troubleshoot and resolve complex technical issues involving in-depth analysis of logs, configurations, system behaviour. Proactively monitor the health, performance, and More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
Curo Resourcing Ltd
domain adjacent technologies/services, such as: Docker, OpenShift, Kubernetes etc. Infrastructure as Code and CI/CD paradigms and systems such as: Ansible, Terraform, Jenkins, Bamboo, Concourse etc. Observability - SRE Big Data solutions (ecosystems) and technologies such as: Apache Spark and the Hadoop Ecosystem Excellent knowledge of YAML or similar languages The following Technical Skills & Experience would be desirable More ❯
architectures , as described by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services More ❯
architectures , as described by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services More ❯
London, England, United Kingdom Hybrid / WFH Options
Wallet in Telegram
balancers (we use Nginx/Traefik, AWS ELB/NLB) Skilled in container orchestration using Docker and Kubernetes Experience with CI/CD processes, specifically with GitLab Knowledge of observability tools like Prometheus/VictoriaMetrics, Grafana, and ELK/EKF/OpenSearch Experience with Infrastructure as Code (IaC) using Ansible and Terraform Scripting abilities in Shell and Python English proficiency More ❯
Slack bots and integrations) to streamline IT operations and business processes. Monitoring and Maintenance: Manage and maintain network security systems through system patches and periodic maintenance tasks. Establish comprehensive observability and proactive issue-resolution strategies using tools like SNMP, Syslog, Netflow, Elasticsearch (ELK Stack), and Grafana. Collaboration and Communication: Work with Cyber Energia teams to identify functional needs, develop secure More ❯
architectures , as described by thought leaders like Martin Fowler. Hands-on experience building and maintaining complex CI/CD pipelines , preferably with GitHub Actions . Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, Google Cloud's operations suite). A solid understanding of networking principles and cloud security best practices. Experience with other cloud platforms like Amazon Web Services More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
CME Group
both independently and collaboratively. Key Responsibilities Collaborate with senior SREs and Product engineering teams to monitor, maintain, and troubleshoot our Markets systems. Collaborate with Product teams to continuously improve observability and alerting of our applications to enable data-driven business decision, faster issue detection and incident resolution. Take accountability for delivery of moderately-complex features. Lead technical discussions for own More ❯
London, England, United Kingdom Hybrid / WFH Options
BBC
Code with AWS CDK , CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions , AWS CodePipeline , CodeBuild , Jenkins . Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus , Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD) , unit testing frameworks (e.g., pytest , unittest ), and automated integration More ❯
London, England, United Kingdom Hybrid / WFH Options
9fin
as possible. Designing and implementing a developer portal (eg. Backstage), to provide a service catalog to the engineering team, and also author many other useful DevOps plugins. Contributing to observability best practices and providing key SLI/SLO metric reporting, so that the engineering team can balance velocity and reliability. Develop inner/open source projects to help provide a More ❯
automated deployments Familiarity with Helm charts Experience with Infrastructure as Code (IaC) tools like Terraform Knowledge of container build and deployment automation using CI/CD pipelines Experience in observability tools for both MSK and Kubernetes, including Prometheus, Grafana, and AWS CloudWatch for metrics and logs Deep understanding of Kafka and Kubernetes security practices, including network policies and IAM roles More ❯
mentoring engineers and collaborating with stakeholders. Proven ability to resolve technical incidents in unfamiliar production systems. Technical and process documentation champion. Experience of operationally managing production software components, including observability, logging, metrics, error reporting, debugging, and live incident management. Your time will be spent roughly as follows: 60% - Proactive technical work (e.g. migrating DB hosting provider, new message bus system More ❯
Python/Java/C Go), with practical experience in automating tasks, building scripts, and solving problems. - Proven track record of successful implementation of DevOps practices and culture (think observability, code reviews, source control management, building feedback loops, removing organization silos...). - Basic understanding of Machine Learning concepts related to model training, deployment, inference, MLOps, etc. PREFERRED QUALIFICATIONS - Experience designing More ❯
FX or crypto trading; front-end experience with React or similar frameworks is a plus. Collaborate with the team to implement, configure, and manage comprehensive monitoring, logging, alerting, and observability solutions - advocating for security best practices. Deploy, manage, operate, and scale applications and services on AWS - whilst troubleshooting performance issues across the stack. Collaborative, agile approach, passionate about clean architecture More ❯
or DevOps Expertise in microservices and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of code using Automated Testing Frameworks More ❯
London, England, United Kingdom Hybrid / WFH Options
Smartcat Platform Inc
familiar with DevOps tools and processes. Confidently navigate through Platform Infrastructure. Day 60 Join the process of being on duty in a team, be able to analyze problems, use observability/monitoring tools and handle investigations. Support Production releases and address blockers of CI/CD process. Day 90 Complete two quarter deliverable in alignment with Outcomes. WHAT YOU’VE More ❯
London, England, United Kingdom Hybrid / WFH Options
Take-Two Interactive
to ask for help when needed. Strong proficiency in AWS, Azure, or GCP, with hands-on experience with Terraform. Experience with configuration management tools like Ansible or Puppet, and observability tools like Prometheus, Grafana, and Datadog, etc. Design, develop, automate testing, and deploy custom tools using languages like Python or C#. Great to Have: Experience with database administration and performance More ❯
Bristol, England, United Kingdom Hybrid / WFH Options
Lloyds Banking Group
collaboration skills, with the ability to influence and align diverse teams on a shared vision. Knowledge of DevOps practices and tools CI/CD pipelines. Knowledge of Monitoring and Observability tooling. In addition, any experience of these would be useful: Experience with data mesh concepts (e.g., domain-driven ownership, and data product thinking). Expertise in GCP services, including BigQuery More ❯
London, England, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration More ❯
Salford, England, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration More ❯
Cardiff, Wales, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
BBC Group and Public Services
as-Code with AWS CDK, CloudFormation to provision and manage cloud environments. Build and maintain CI/CD pipelines using GitHub Actions, AWS CodePipeline, CodeBuild, Jenkins. Integrate monitoring and observability tools such as AWS CloudWatch, Prometheus, Grafana for infrastructure and model health tracking. Ensure software quality through Test-Driven Development (TDD), unit testing frameworks (e.g., pytest, unittest), and automated integration More ❯