and support 6-8 years of experience in writing automation scripts, building application dashboards for proactive monitoring, setting up Alerts for early determination of the issues in Splunk, Grafana, Datadog etc 6-8 years of experience practicing SDLC (Software Development Lifecycle) practice, process improvements Hands on enterprise systems administration, monitoring, and deployment activities Experience with Windows 2016, 2019, 2022 hosted More ❯
Phoenix, Arizona, United States Hybrid / WFH Options
Charles Schwab
and support 6-8 years of experience in writing automation scripts, building application dashboards for proactive monitoring, setting up Alerts for early determination of the issues in Splunk, Grafana, Datadog etc 6-8 years of experience practicing SDLC (Software Development Lifecycle) practice, process improvements Hands on enterprise systems administration, monitoring, and deployment activities Experience with Windows 2016, 2019, 2022 hosted More ❯
Manchester, Lancashire, England, United Kingdom Hybrid / WFH Options
Clarke Recruitment Solutions
needed) Working with Docker and container orchestration (ECS/EKS, Helm) Streamlining and optimising CI/CD pipelines (GitHub Actions/GitLab CI) Setting up and managing observability with Datadog, CloudWatch, Prometheus/Grafana Automating deployments and improving recovery, redundancy, and capacity planning Supporting Linux environments (Ubuntu/CentOS) Getting involved in incident response and helping us prevent problems before … and automation tools Hands-on with containers and orchestration (Docker, ECS/EKS, Helm) Experience with CI/CD pipelines (GitHub Actions or GitLab CI) Familiarity with monitoring tools (Datadog, CloudWatch, Prometheus, Grafana) Confident scripting in Python and Bash Strong communication skills and collaborative mindset Nice to have (not essential): Experience with Azure or GCP Knowledge of networking (VPC Peering … of your game A collaborative, supportive team environment where your input matters Tech stack you’ll work with AWS | Terraform | Ansible | Docker | ECS/EKS | GitHub Actions | GitLab CI | Datadog | CloudWatch | Prometheus | Grafana | Linux | Python | Bash If you’re passionate about automation, thrive on solving complex problems, and want your work to make a genuine difference when it matters most More ❯
with DNS, NTP, SAML, OAuth 2, Active Directory Understanding of systems monitoring, alerting and analytics (Prometheus, New Relic, AppDynamics, Cacti, Graphite, ELK, Nagios, Ganglia, Splunk, Log Insight, Realize Operations, Datadog, etc.) Background in Windows deployment, automation, and related technologies Experience with database (e.g. MySQL, PostgreSQL, Oracle, MSSQL) and messaging middleware technologies (e.g. RabbitMQ) Participated in or led agile development teams More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
Client Server Ltd
in Azure (will also consider AWS or GCP experience) You have a deep understanding of cloud infrastructure and services including best practices around monitoring, scaling and security tools e.g. DataDog You have strong scripting skills with PowerShell (or Python) You have a good knowledge of basic networking, TCP/IP You have a good understanding of IaC, they use Pulumi More ❯
ClaimCenter and other systems, including PAS, document management systems, and external data providers. Platform Monitoring : Determine requirements for specific alerts, set up alerts for various events and thresholds, utilise Datadog logs and dashboards for error analysis, and track DXC downtime while communicating updates to users. Platform Updates : Conduct a 3-way merge of updated code, validate new versions, and implement More ❯
level production incidents The Person: 5+ years in SRE, DevOps, or infrastructure engineering Strong experience with AWS, EKS/Kubernetes, and Terraform Familiar with Kafka and observability tools like Datadog or Grafana Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment
level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering *Strong experience with AWS, EKS/Kubernetes, and Terraform *Familiar with Kafka and observability tools like Datadog or Grafana *Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH(phone number removed) To apply for this role or for to be considered for further roles More ❯
Employment Type: Permanent
Salary: £80000 - £90000/annum 38 Days Holiday, Healthcare, Pension
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
level production incidents The Person: *5+ years in SRE, DevOps, or infrastructure engineering*Strong experience with AWS, EKS/Kubernetes, and Terraform*Familiar with Kafka and observability tools like Datadog or Grafana*Able to troubleshoot issues across infrastructure and application layers Reference number: BBBH259300 To apply for this role or for to be considered for further roles, please click "Apply More ❯
North West London, London, United Kingdom Hybrid / WFH Options
ByteHire
of infrastructure setup and management Exposure to designing or building distributed systems, preferably in a cloud environment Company Tech Stack PHP, Laravel, ReactJS, TypeScript, Inertia, WordPress MySQL, Redis, ElasticSearch, DataDog, AWS, Terraform, Docker Benefits Hybrid working 1-2 days per week in the London office. Collaborate directly with the founding team and take ownership of product features. Be part of More ❯
Southlake, Texas, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Omaha, Nebraska, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Papillion, Nebraska, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Richfield, Ohio, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Phoenix, Arizona, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Chicago, Illinois, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Cicero, Illinois, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Riverside, Illinois, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Lone Tree, Colorado, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Oak Park, Illinois, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Elmwood Park, Illinois, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯
Ann Arbor, Michigan, United States Hybrid / WFH Options
Charles Schwab
SLO strategy for at least 5 teams, ensuring alignment with business and client objectives. Three or more years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts. Developed at least 5 scripts or tools that reduced repetitive operational toil. Led or participated in at least More ❯