scripting skills in Bash, PowerShell, or Python . Strong understanding of security practices in infrastructure design and operations. Experience with monitoring tools such as Prometheus, Grafana, Nagios, or similar . Preferred Skills: Relevant certifications ( AWS Certified Solutions Architect, Microsoft Azure Architect, RHCE, etc.). Experience with CI/CD pipelines More ❯
Experience with modern software development practices: version control, agile, CI/CD. Knowledge of observability tools in distributed systems (e.g., Elasticsearch, Logstash, Kibana, Datadog, Prometheus, Grafana). Experience with cloud providers like AWS and GCP. Willingness to learn new technologies and grow professionally. Ability to work effectively in fast-paced More ❯
City Of London, England, United Kingdom Hybrid / WFH Options
Fruition Group
delivery. Lead deployment strategies and ensure smooth feature rollouts with minimal downtime. Define and manage monitoring, logging, and telemetry using tools like AWS Cloudwatch, Prometheus, and Datadog. Lead incident response and production troubleshooting with a proactive and preventative mindset. Drive automation initiatives with tools like GitlabCI, Terraform/OpenTofu, Ansible More ❯
london (city of london), south east england, United Kingdom Hybrid / WFH Options
Fruition Group
delivery. Lead deployment strategies and ensure smooth feature rollouts with minimal downtime. Define and manage monitoring, logging, and telemetry using tools like AWS Cloudwatch, Prometheus, and Datadog. Lead incident response and production troubleshooting with a proactive and preventative mindset. Drive automation initiatives with tools like GitlabCI, Terraform/OpenTofu, Ansible More ❯
GCP, Azure). Strong knowledge of CI/CD, containerization (Docker, Kubernetes), networking, distributed systems, and databases. Experience with monitoring and troubleshooting tools (DataDog, Prometheus, Grafana, ELK, Splunk, Humio). Excellent problem-solving, attention to detail, and communication skills. Desirable Experience with Azure, autonomous vehicles, or ML/AI projects. More ❯
CI/CD processes, containerization (Docker, Kubernetes), and a deep understanding of networking, distributed systems, and databases. Expert with monitoring and troubleshooting utilities (DataDog, Prometheus, Grafana, ELK stack, Splunk, Humio, etc.). Exceptional problem-solving skills and a detail-oriented mindset, coupled with outstanding communication abilities. Desirable Experience with Azure More ❯
DevOps culture and cloud platforms such as Azure, AWS, or GCP. Knowledge of Kubernetes is desirable. Experience monitoring application performance with tools like Grafana, Prometheus, DataDog, or Sentry. Strong advocate for performant applications and simplicity in problem-solving. Excellent communication skills for collaborating across audiences. Experience with high-traffic service More ❯
cloud-based hosting platforms like AWS, Azure, or GCP and/or experience with hardware-based environments. Familiarity with monitoring systems using tools like Prometheus and writing health checks. Proficiency with one programming language, such as Java, Go, Python, JavaScript, or similar languages. #J-18808-Ljbffr More ❯
Git). Excellent problem-solving skills and attention to detail. Strong communication and teamwork abilities. Preferred Qualifications: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack). Familiarity with Agile methodologies and DevOps practices. Benefits: Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care More ❯
Git). Excellent problem-solving skills and attention to detail. Strong communication and teamwork abilities. Preferred Qualifications: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack). Familiarity with Agile methodologies and DevOps practices. Benefits: Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care More ❯
e.g., Git) Excellent problem-solving skills and attention to detail Strong communication and teamwork abilities Preferred Qualifications: Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) Familiarity with Agile methodologies and DevOps practices Benefits: Enhanced leave - 38 days inclusive of 8 UK Public Holidays Private Health Care including More ❯
or Google Cloud (BigQuery). DevOps and Automation: CI/CD: Jenkins, GitLab CI/CD, or CircleCI. IaC: Terraform or AWS CloudFormation. Monitoring: Prometheus, Grafana, or Datadog. #J-18808-Ljbffr More ❯
cloud-based hosting platforms like AWS, Azure, or GCP and/or experience with hardware-based environments. Familiarity with monitoring systems using tools like Prometheus and writing health checks. Proficiency with one programming language, such as Java, Go, Python, JavaScript, or similar languages. Life at Palantir We want every Palantirian More ❯
and/or internals. Experience working with cloud solutions (GCP or AWS). Deep understanding and demonstrable experience with modern monitoring tools such as Prometheus, Datadog, Grafana, Telegraf Experience with infrastructure as code tools. Experience with complex Terraform deployments is a plus. Solid background with configuration management tools. Experience with More ❯
cloud-based hosting platforms like AWS, Azure, or GCP and/or experience with hardware-based environments. Familiarity with monitoring systems using tools like Prometheus and writing health checks. Proficiency with one programming language, such as Java, Go, Python, JavaScript, or similar languages. Life at Palantir We want every Palantirian More ❯
Familiar with databases (SQL or NoSQL). Experience with client/server software architectures & networking, or microservice architectures. Experience with observability tools like Grafana, Prometheus, Open Telemetry and others. Experience with streaming architectures and tools (e.g. Kafka) About Us J.P. Morgan is a global leader in financial services, providing strategic More ❯
containerised environment. Experience with GitOps methodology and argoCD. Experience with CI/CD solutions (Github Actions, Circle CI, Jenkins etc.) Hands-on experience with Prometheus, Grafana, or other comparable monitoring tools Nice to haves Experience with Developer Experience (DevX) Tools and Practices: Proven track record in enhancing developer productivity through More ❯
with Kubernetes is desirable. You have a high degree of experience in observing the performance and health of applications via tools such as Grafana, Prometheus, Data Dog, Sentry, etc. You have a strong desire and are an advocate for performant applications. You have a flair for simplicity when problem solving. More ❯
e.g. NMS applications, controllers, orchestrators, supervisory systems, etc.). Experience and understanding of Kafka messaging bus. Experience in using monitoring tools like Nagios, Grafana, Prometheus and Kibana is desired. Deployment environment: Kubernetes, Docker, microservices. Experience on Talos Kubernetes is an advantage. Deployment experience in cloud-based environment AWS/Azure More ❯
Infrastructure provisioning Process automation Respond to change requests Skills & Experience Oracle DB Docker (with Docker Swarm) Elastic Stack Typescript/React/Node Go Prometheus/Grafana ESRI Maps Ansible Windows & Linux Jenkins Automation skills: Automation is a key skill domain for this role. Specific automation skills are: Continuous Integration More ❯
hardware and software products. Experience implementing automated testing frameworks in a hardware-in-the-loop (HITL) environment. Familiarity with monitoring and logging tools (e.g., Prometheus, ELK stack). Experience with Nix/NixOS. Technical expertise and demonstrated performance in one or more of the following areas: networking, cloud technologies, application More ❯