City of London, England, United Kingdom Hybrid / WFH Options
Cyber Security Jobsite
and API design Docker, and container runtime platforms such as Kubernetes, EKS, ECS etc Strong understand of operational concepts on AWS, particularly monitoring and observability, FinOps Utilising CI/CD tools, such as Bamboo, Jenkins, TeamCity, Bitbucket, in order to streamline delivery of new features and fixes Continual testing of more »
pipelines with Jenkins, GitLab CI/CD or similar - Containerize applications using Docker and orchestrate with Kubernetes - Monitor systems with Prometheus, Grafana and apply observability best practices - Automate deployment processes and improve DevOps workflows - Ensure high availability, fault tolerance and disaster recovery of cloud infrastructure - Collaborate with developers and operational more »
all stages of the software delivery lifecycle Passion for mentoring and technical coaching of developers Excellent verbal and written communication skills Preferred: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
for mentoring and technical coaching of developers Excellent verbal and written communication skills The following skills and experience are nice to have: Experience with observability and monitoring tools such as Datadog Familiarity with polyglot persistence and experience with a NoSql DB especially DocumentDB Experience with Kafka, HiveMQ, ElasticSearch Infrastructure as more »
infrastructure needs, ideally in a self-service environment. WHAT YOU’LL DO: Build and provide platform features for compute, authentication, service discovery and calls, observability and redundancy Lead architectural approaches with the right tradeoffs between scale, cost and maintenance Coach junior engineers about design, development, testing and deployment best practices more »
a security-first focus, baked-in not bolted-on.Desirable Criteria· Previous experience in Python or Golang.· Previous experience of Ansible and Terraform.· Understanding of observability and its implementation.· Understanding and ability to provide quality, automated testing as you develop.· Ability to design solutions independently.· Networking knowledge.· Understanding of data science more »
in GitHub as a Source Code Management (SCM) platform, with hands-on experience in version control and collaborative coding. Experience with Kubernetes Experience with observability tools such as Datadog, Grafana, and Elastic for monitoring, analytics, and proactive issue identification. Experience in incident management, showcasing the ability to respond promptly and more »
normal—and that’s where you come in! We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team. Responsibilities: Design, build and maintain scalable, and reliable cloud infrastructure in AWS Monitor and manage the performance, reliability, and security of more »
/scripting languages such as Python, Java, or Go, utilized for automating deployment, configuration, management, and monitoring processes. Strong understanding and practical experience with observability/monitoring tooling like New Relic, Dynatrace, or Splunk. Ability to define and create monitors/alerts at both infrastructure and application layers. Demonstrated automation more »
and applications in a cloud-first environment. · First-hand deep expertise in engineering ways of working such as CI/CD, release lifecycle, data observability, data testing, continuous model validation with tangible track record of instituting change. · Software development experience in Python, or Scala. Familiarity with all, and expert in more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
Solid understanding of the processes involved in release, integration, and deployment, along with an awareness of the promotion pathways within these processes. Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog Familiarity more »
deployment, and development of cloud computing platforms such as Oracle Cloud Infrastructure (OCI)/AWS/Azure is a must. Experience with monitoring and observability using tools like Grafana. Strong understanding of Containers using Docker and Kubernetes. Ability to convert conceptual architectures into functional infrastructures end to end. Ideally some more »
other teams, with documentation and examples. In addition, you should have experience working with Kubernetes in production environments at scale, and be familiar with observability tools such as Prometheus and Grafana. Strong Linux Server Administration and Configuration Management skills, as well as some networking experience, are also required. The ideal more »
normal—and that’s where you come in!We are seeking a skilled Site Reliability Engineer (SRE) with experience in AWS, Serverless, Monitoring, and Observability to join our team.Responsibilities:Design, build and maintain scalable, and reliable cloud infrastructure in AWSMonitor and manage the performance, reliability, and security of our systemsImplement more »
improve skills.Previous ownership of mission-critical shared infrastructure.Nice to HaveExperience with CI/CD systems, in particular Spinnaker.Experience in implementing SRE principles.Good knowledge of observability stacks and tooling (e.g. Grafana, ELK, Prometheus, Tracing).Good discipline and skill in producing written documentation and diagrams.Good knowledge of cloud networking and security.Experience working more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
Birmingham, West Midlands, United Kingdom Hybrid / WFH Options
Interact Consulting Limited
on experience with Terraform for Infrastructure as Code (IaC). Proficiency with Docker, Kubernetes, and AKS. Familiarity with Azure cloud-native services. Knowledge of observability and site-reliability engineering principles. Proficiency in SQL and experience working with relational databases. more »
responsibilities include: Monitoring the performance and reliability of the company's global online platforms Troubleshooting issues via proactive and reactive monitoring and alerts using observability tooling and logging service requests Enhancing the tech stack/configurations of existing services to improve site performance and reduce issues Exploring and delivering on more »
the lead on projects to improve our DevOps: CI/CD pipeline (vulnerability scanning, static analysis, tests), blue/green deploys, auto load balancing, observability & instrumentation, infrastructure as code (eg Terraform) etc. Take the lead on projects to refactor our codebase, separating domain-specific logic, application logic and UI code more »
solutions including for compute, storage, messaging and networking Good understanding of Docker and experience with CI/CD tooling Good understanding of security and observability best practices and tooling Effective communication and collaboration skills What else about you would excite us? Experience building and maintaining high-traffic server-side web more »
to SRE/Platform Engineering, or are still working as a Java Developer extensive experience with AWS, Kubernetes, Terraform, CI/CD tools strong observability experience, ideally with more modern approaches like Prometheus, Grafana, Open Telemetry comfortable with databases exposure to Kafka would be ideal more »
Northern Ireland, United Kingdom Hybrid / WFH Options
SoftwareOne
CI/CD pipeline development IaC, specifically Terraform or a wrapper Kubernetes, or a Cloud-managed container orchestrator, e.g. AKS, EKS Experience operating an observability stack preferred Bonus points for being security-minded About You: You have experience operating a mature, high volume stack in the cloud You have opinions more »
CI/CD pipeline development IaC, specifically Terraform or a wrapper Kubernetes, or a Cloud-managed container orchestrator, e.g. AKS, EKS Experience operating an observability stack preferred Bonus points for being security-minded About You: You have experience operating a mature, high volume stack in the cloud You have opinions more »