manage AWS Connect contact centre solutions to enhance customer communication capabilities Transform routine tasks through automation, dramatically improving system efficiency Leverage cutting-edge monitoring tools including Grafana, CloudWatch, and DataDog to ensure peak system performance Troubleshoot complex infrastructure challenges across development, testing, and production environments Implement fortress-like security practices across all infrastructure and services Collaborate with brilliant development teams More ❯
Southlake, Texas, United States Hybrid / WFH Options
Charles Schwab
with Monitoring Tools, Linux administration; Proficient in Kafka administration, including installing software, modifying configuration files, and agent management. Highly efficient multi-tasker and great organization skills. Splunk, Grafana, and Datadog experience a plus. Duties will include: On-boarding new Kafka producer and consumer use cases. Engineering and supporting the enterprise telemetry pipeline Testing and deploying software upgrades. Managing and supporting More ❯
Washington, Washington DC, United States Hybrid / WFH Options
Promise Co
troubleshooting. Hands-on experience with cloud platforms (AWS, Azure, or GCP). Expertise in Infrastructure-as-Code (IaC) using Terraform or similar tools. Proficiency in monitoring tools (e.g., Prometheus, Datadog) and health check implementation. Experience with containerization (Docker, Podman, Kubernetes). Scripting experience (Python, Bash, or equivalent) to automate infrastructure management. Knowledge of networking and security best practices for cloud More ❯
Brookfield, Wisconsin, United States Hybrid / WFH Options
Milliman
platforms. Work with additional data technologies such as Redis, Kafka, Amazon SQS, etc. Familiar with Terraform and infrastructure as code concepts Familiarity with setting up dashboards and alerting in Datadog and AWS Troubleshooting and Support: Provide expert-level support for database-related issues Perform root cause analysis and implement long-term solutions for recurring problems Collaborate with development teams to More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
In Technology Group Limited
Job Title: Cloud Engineer Location: Manchester (hybrid- 1-2 days onsite) Industry: Financial Services/Fintech Salary: £50,000-60,000 per annum Overview: We are seeking a skilled and motivated Cloud Engineer to join our dynamic IT team at More ❯
raised our Series D. This brings our total funding to over $330M from top-tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook. About the role DevOps is a culture not a team, but right now, we don't have a better name for the group of people working … driving down costs. Application development. If you're currently a application engineer working in Python or NodeJS with a strong operational slant, that can work well for us. Observability (Datadog), with a strong focus on enabling and empowering Engineering teams to understand their product in Production. SAAS Networking. Geolocation based performance, the path to multi-region, frontend performance optimisation. We More ❯
Greater Leeds Area, United Kingdom Hybrid / WFH Options
Anson McCade
pipelines (e.g., GitHub Actions, CircleCI). 💡 Bonus Skills That Impress Delivery into cloud platforms (AWS, Azure, GCP). Familiarity with relational and NoSQL databases. Experience with observability tools (e.g., DataDog, Prometheus). Test automation know-how. Exposure to open-source tools and community practices. Ready to build what matters? Apply now to shape the future of digital engineering in an More ❯
/7 on-call support for critical events Strong track record with AWS ecosystem Terraform, Docker, and modern CI/CD pipelines Proven experience working with observability tools like DataDog, Prometheus, Grafana, and log stacks such as ELK or CloudWatch Solid scripting skills in Python, Go or NodeJS Understanding of network protocols Strong problem-solving mindset and a sense of More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Anson McCade
leadership experience. Desirable • Previous work in secure public sector or central government programmes. • Familiarity with cloud platforms (AWS, Azure, or GCP). • Experience with logging and monitoring stacks (e.g. DataDog, Prometheus, ELK). • Database expertise (SQL, NoSQL, data modelling). • Knowledge of test automation frameworks and shift-left testing approaches. Benefits Package • Paid overtime – contractually included (up to 200 hours More ❯
the past) Prompt Engineering Weaviate (we use it for RAG in LLM powered tasks and for hybrid searches) Kubernetes (we run Weaviate and other specific services on Kubernetes) CircleCI DataDog Auth0 (we use it, but we would rather not have to) Role Based Access Control JsonSchema Pusher (we use it to push notifications to our Frontend Web App) LocalStack (we More ❯
Charlotte, North Carolina, United States Hybrid / WFH Options
Total Quality Logistics
pipelines for database management Strong desire to be a self-learner and or a "pioneer" Experience with containerization (Docker, Kubernetes) preferred Familiarity with observability tools (Solarwinds, OpenTelemetry, and or Datadog) desired Deep understanding of at least one database technology (SQL Server, PostgreSQL, MongoDB, Elasticsearch, and or Azure) preferred Experience with multi-region and globally distributed databases preferred About Us Total More ❯
Cincinnati, Ohio, United States Hybrid / WFH Options
Total Quality Logistics
pipelines for database management Strong desire to be a self-learner and or a "pioneer" Experience with containerization (Docker, Kubernetes) preferred Familiarity with observability tools (Solarwinds, OpenTelemetry, and or Datadog) desired Deep understanding of at least one database technology (SQL Server, PostgreSQL, MongoDB, Elasticsearch, and or Azure) preferred Experience with multi-region and globally distributed databases preferred Where you'll More ❯
Tampa, Florida, United States Hybrid / WFH Options
Total Quality Logistics
pipelines for database management Strong desire to be a self-learner and or a "pioneer" Experience with containerization (Docker, Kubernetes) preferred Familiarity with observability tools (Solarwinds, OpenTelemetry, and or Datadog) desired Deep understanding of at least one database technology (SQL Server, PostgreSQL, MongoDB, Elasticsearch, and or Azure) preferred Experience with multi-region and globally distributed databases preferred Where you'll More ❯
Chicago, Illinois, United States Hybrid / WFH Options
Charles Schwab
maintenance, and incident management, this could be the role for you. What you have Required Qualifications: 3+ years hands-on experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk), with a proven track record of setting up dashboards and alerts Developed scripts or tools that reduced repetitive operational toil 7+ years of experience with a systems programming language More ❯
United, Pennsylvania, United States Hybrid / WFH Options
Bright Horizons
experience in automation tools and programming/scripting languages (e.g., PowerShell, Python, Bash) to deliver improvements at a small and large scale. Strong understanding of Observability tools (e.g., Dynatrace, Datadog, New Relic etc.) and best practices, to implement effective monitoring of SLI/SLO/SLAs. Strong experience and understanding of software engineering, Infrastructure as Code (Ansible or Terraform) and More ❯
Cambourne, Cambridgeshire, United Kingdom Hybrid / WFH Options
Remotestar
Job description RemoteStar is looking to hire a Senior Site Reliability Engineering Manager on behalf of our client based in the UK with a fully remote work policy. About Client: The client building, the B2B marketplace for diamonds. It's More ❯
Portsmouth, Hampshire, United Kingdom Hybrid / WFH Options
Checkatrade
Senior Platform Engineer Experience in Cloud Native technologies? Come join us! Are you looking for a new role? We have an exciting opportunity at Checkatrade for a Senior Platform Engineer to join our mission of making home improvements easy by More ❯
Tampa, Florida, United States Hybrid / WFH Options
Aquarius Professional Staffing
Experience with scalable, highly available system design focusing on performance and resiliency Experience with Infrastructure as Code (Terraform), containerization (Docker, Kubernetes), CI/CD (Jenkins, CircleCI), operational tools (ServiceNow, DataDog) Experience with DevOps practices and coaching others Microsoft Azure and cloud experience is a plus Willingness to put in extra effort to meet deadlines for high-profile projects Seniority level More ❯
San Diego, California, United States Hybrid / WFH Options
Sony Interactive Entertainment
Go, and Node.js. to collaborate, automate and improve quality of software deliverables; building and deploying infrastructure as code in Cloud Formation and Terraform; and utilizing application monitoring tools, including DataDog, CloudWatch, SumoLogic, Kubernetes, ECS, and EKS to improve observability and resiliency of software applications, reducing MTTD and MTTR. Telecommuting and/or working from home may be permissible pursuant to More ❯
Leeds, England, United Kingdom Hybrid / WFH Options
Anson McCade
Nice to Have (But Not Essential) Cloud experience: AWS, Azure or GCP Solid grasp of databases and data modelling Familiarity with open-source tools and monitoring platforms (e.g., Prometheus, DataDog ) Experience with test automation frameworks and performance tools Ready to Shape the Future of Secure Tech? Apply now with your CV and start your journey with one of the most More ❯
Raleigh, North Carolina, United States Hybrid / WFH Options
Hyperdrive Recruiting
by example, infusing modern SRE best practices and fostering a culture of reliability and performance. Provide technical guidance and mentorship to team members. Ideal Background Hands-on experience with Datadog, OpenTelemetry, Sentry, Sumo Logic or similar monitoring and observability platforms. Proficiency in a modern programming language, with a proven ability to write clean, maintainable, and efficient code; Ruby, Rails, and More ❯
infrastructure using ECS, Lambda, and CloudWatch CI/CD pipelines with GitHub Actions, Terraform, and IaC Testing via Cypress (E2E), Jest/Vitest (unit) Observability with tools like Sentry, Datadog, or New Relic You won’t be starting from scratch, the platform is already live and scaling, but there is real scope to improve code quality, DevOps maturity, and feature More ❯
Solid understanding of network configurations, security groups, and cross-account connectivity in AWS. Familiarity with DevOps practices, version control (Git), and automation frameworks. Familiarity with monitoring tools such asCloudWatch, Datadog, or similar. Excellent problem-solving, communication, and collaboration skills. Relevant certifications, such as AWS Certified Solutions Architect or AWS Certified Database Specialty are a plus. Advantageous What we value most More ❯
code reviews to develop clean, secure, testable and maintainable code Gain exposure to DevOps tools and processes, including containerisation (e.g., Docker), CI/CD pipelines, and observability platforms (e.g., Datadog, Grafana) Troubleshoot and resolve technical issues in collaboration with your team Stay curious and proactive, learning new technologies and contributing ideas to innovate our platform What We're Looking For More ❯
Seattle, Washington, United States Hybrid / WFH Options
DAT Freight Solutions
/Infrastructure teams. Total of 10+ years industry experience 3+ years of software engineering experience (JavaScript, Python, Go, Java/Kotlin, C++, etc) Extensive experience with modern observability tools (Datadog preferred). Extensive experience with cloud platforms (preferably AWS). Demonstrated success in leading large technical initiatives, including design, project management and gaining executive buy-in. Proven experience modernizing legacy More ❯