for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
Warwick, Warwickshire, West Midlands, United Kingdom
Tata Technologies Europe Ltd
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
services and technologies, particularly AWS, to optimize the performance and security of our cloud infrastructure. Monitor system health, performance, and availability using monitoring and observability tools, proactively identifying and resolving issues. Collaborate with cross-functional teams to troubleshoot and resolve complex infrastructure issues, minimizing downtime and improving system reliability. Mentor more »
infrastructure needs, ideally in a self-service environment. WHAT YOU’LL DO: Build and provide platform features for compute, authentication, service discovery and calls, observability and redundancy Lead architectural approaches with the right tradeoffs between scale, cost and maintenance Coach junior engineers about design, development, testing and deployment best practices more »
in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
/scripting languages such as Python, Java, or Go, utilized for automating deployment, configuration, management, and monitoring processes. Strong understanding and practical experience with observability/monitoring tooling like New Relic, Dynatrace, or Splunk. Ability to define and create monitors/alerts at both infrastructure and application layers. Demonstrated automation more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
mesh Strong knowledge of CI/CD tools such as Jenkins. Experience with configuration management tools (Ansible, Puppet or Chef) Expertise in monitoring and observability technologies like Prometheus, Appdynamics, Splunk, Jaeger, Kiali, Open Telemetry. Expertise in cloud and DevOps, familiar to network (VPC) and firewall on cloud, iAM, cloud costing more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
on experience with Terraform for Infrastructure as Code (IaC). Proficiency with Docker, Kubernetes, and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliability engineering principles. Proficiency in SQL and experience working with relational databases. more »
responsibilities include: Monitoring the performance and reliability of the company's global online platforms Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues Exploring and delivering on more »
the lead on projects to improve our DevOps: CI/CD pipeline (vulnerability scanning, static analysis, tests), blue/green deploys, auto load balancing, observability & instrumentation, infrastructure as code (eg Terraform) etc. Take the lead on projects to refactor our codebase, separating domain-specific logic, application logic and UI code more »
solutions including for compute, storage, messaging and networking Good understanding of Docker and experience with CI/CD tooling Good understanding of security and observability best practices and tooling Effective communication and collaboration skills What else about you would excite us? Experience building and maintaining high-traffic server-side web more »
Farnborough, Hampshire, South East, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
IaC hands on Terraform is a must. Excellent understanding of networking principles (IP addressing, virtual networks, network security and networking models). Understanding of observability and site-reliability principles (SLO's, SLI's) and working with engineering teams to improve the applications and platform. Good understanding of SQL and working more »
using Python and TypeScript/JavaScript. Strong API design skills and a familiarity with building web applications. A proponent of great testing, first-class observability and automating everything. Familiarity with security principles and practices, including authentication, authorization, and encryption. Experience with cloud technologies (e.g. Docker, Kubernetes, Helm & Terraform) and platforms more »
Greater Manchester, England, United Kingdom Hybrid / WFH Options
Clarus WMS
skills, and the capacity to work well in a team environment are often considered essential. Understanding of DevOps principles and practices, including monitoring, logging, observability and infrastructure management. Why Join Clarus Software? Here are some of the reasons you will love working here: A team of amazing people. We are more »
cloud platform engineering, Transforming to a multi-cloud operating model, multi-cloud data platform, multi-cloud container platform, multi-cloud AI platform, SRE, Business observability , etc. Ability to translate strategic business priorities into supporting technology programs Exposure to some of these industry sectors (Banking, Financial Services, Retail sectors, Energy & Resources more »
primarily in an AWS Cloud environment using approaches/tooling similar to ours (see technical skills), delivering scalable, performant, reliable solutions. Data reliability/observability, data governance and information security credentials What technical skills are required Ability to implement and use modern data platform concepts, frameworks and tooling, covering all more »
required: Strong Cloud experience with AWS and AWS Services Containerisation/Orchestration with Kubernetes Strong understanding of IaC with Terraform Wealth of Monitoring and Observability experience Knowledge of Security/DevSecOps practices This position can offer £90-110K, plus benefits, and operates a hybrid working model (with 3 office more »
a Lead DevOps Engineer Design and develop cutting edge computing infrastructure in a modern cloud environment - Azure and Kubernetes (AKS). Design and develop observability and monitoring solutions to optimize the product and end user experience. Leverage cutting edge technologies to ensure CI/CD in Azure DevOps Mentor junior more »
able to build automation tools (we use Python, but any language is fine) Extensive experience with AWS, K8s, Terraform Experience designing, building and maintaining observability tooling and CI/CD pipelines Understanding of Security, Networking and Databases 👍 Bonus points for: Having worked as a software engineer before shifting your focus more »
servers, networks, and cloud infrastructure. Address faults and incidents affecting uptime. Understand architecture and service interdependencies. Manage security vulnerabilities and implement mitigations. Ensure comprehensive observability and monitoring. Implement and test data backups and restores. Collaborate to triage and resolve issues. Provision resources for new projects and products. Attend project meetings more »
Helm-Charts, YAML) and familiarity with tools like Visual Studio Code and Git are necessary. Good knowledge of Infrastructure as Code with Azure and observability tools such as Prometheus or Loki will make you a perfect candidate. The ability to digest new information and apply critical thinking, possibly demonstrated by more »