experience with AWS cloud infrastructure • Deep understanding of IaC tools: Terraform, Packer, CloudFormation • Proven leadership in multidisciplinary delivery teams • Skills in Databases: MongoDB/Atlas, Messaging: Kafka, Observability: Prometheus, Grafana, Splunk • Experience of working in a DevOps environment - favouring and implementing Continuous Integration & Deployment over manual processes. • Experience of designing, implementing, securing and supporting Unix/Linux based platforms (ideally More ❯
. Strong understanding of distributed systems, microservices architecture, and RESTful API design. Hands-on experience with Kubernetes and container orchestration. Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position. Hands-on experience with incident response, including designing and improving incident management processes. Expertise in More ❯
Static Analysis Tools - SonarQube Scripting - Shell, Power Shell Knowledge of Virtualisation - VmWare, VirtualBox Enterprise use of Git and/mercurial or Bitbucket Monitoring tools such as Prometheus, Sensu and Grafana Experience with cloud platforms (AWS, Azure, or GCP). Experience with containerization (Docker) and orchestration (Kubernetes). Knowledge of Infrastructure as Code tools (Terraform, Ansible, or similar). Unit and More ❯
be advantageous: Software development in web technologies or OOP (e.g., Python, Java, etc.) Database tech: Oracle SQL, PostgreSQL, MongoDB Proficient with Linux/Windows command line (Bash, PowerShell) Monitoring: Grafana, Prometheus, ELK, Splunk Agile working and tooling (e.g., Jira, Confluence) Diagnosing and resolving complex system issues ITIL knowledge or exposure to IT service operations Containerisation: Docker, Kubernetes, OpenShift Awareness of More ❯
engineering teams, QA, DevOps, and product managers to deliver integrated solutions Mentor engineers in architectural thinking and AI-assisted development Ensure architectural alignment across systems with observability using Prometheus, Grafana, ELK Stack Required Skills & Qualifications: Master's degree in Computer Science, Software Engineering, or related field 8+ years of software engineering experience, with 3+ years in architectural or staff engineering More ❯
Columbia, Maryland, United States Hybrid / WFH Options
Codescratch LLC
development tool suites. Preferred Skills and Experience: Experience with Docker and Kubernetes Experience with Hadoop Experience with Spark Experience with Accumulo Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Location: Columbia Annex, MD (60%+ telework) Salary Range: $115,000 - $200,000.00 More ❯
standard software development tool suites. Preferred Skills and Experience: Experience with Docker and Kubernetes Experience with Virtual Machines Experience with Networking Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElasticSearch, Logstash, Kibana) Have, or obtain Security+ certification or equivalent DoD 8570 IAT II certification Location Fort Eisenhower, GA (Appx 50% hybrid telework) Salary Range More ❯
Sonatype Nexus Knowledge and working experience of containerising application components including writing DockerFiles and deploying to Kubernetes Deep understanding of pipelines as code Observability concepts and tooling; Opensearch, Cribl, Grafana, Prometheus, CloudWatch Experience of working with agile teams Job Band & Level: Manager/7 #Li-Hybrid #LI-MID Not The Perfect Fit? Concerned that you may not meet the criteria More ❯
pipelines using modern DevOps tools and methodologies. Responsibilities Working with infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Setting up and maintaining monitoring and alerting tools (e.g., Prometheus, Grafana, Nagios). Managing log management and aggregation tools (e.g., ELK Stack, Splunk). Understand and implement security best practices in accordance with DevOps principles. Implement Continuous Integration and Continuous Delivery More ❯
estatescontributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI … present monitoring solutions and technical designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability to troubleshoot complex infrastructure and application More ❯
contributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI … present monitoring solutions and technical designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability to troubleshoot complex infrastructure and application More ❯
contributing to their operational success through proactive insight and incident prevention. What you'll do Design, implement, and manage observability solutions using industry-leading tools such as Dynatrace (primary), Grafana, and Splunk Collect and analyse telemetry data (metrics, logs, traces, events) to diagnose and resolve system and application performance issues Integrate monitoring platforms with ITSM tools (e.g. ServiceNow) and CI … present monitoring solutions and technical designs Proactively identify and highlight risks that could impact solution success What you'll need Strong experience deploying and managing observability platforms including Dynatrace, Grafana, and/or Splunk Deep understanding of telemetry signal analysis and performance monitoring Experience integrating observability tools with ITSM platforms and DevOps toolchains Ability to troubleshoot complex infrastructure and application More ❯
APIs Experience with Git Source Control System Position Desired Skills Experience with Messaging Frameworks such as Kafka, ActiveMQ, and RabbitMQ Experience with tools used for metrics visualization such as Grafana and Kibana Experience with containerization technologies such as Docker Experience with the Atlassian Tool Suite (JIRA, Confluence More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
True North Group
prior history as developer or the ability to code Infrastructure As Code - Ansible, Terraform or equivalent Containerisation using Docker, and orchestration via Kubernetes or Docker Swarm Monitoring with ELK, Grafana or equivalent CI/CD with TeamCity Microsoft SQL Server Windows and Linux administration experience The ability to communicate clearly and professionally with our clients as well as team members More ❯
prior history as developer or the ability to code Infrastructure As Code - Ansible, Terraform or equivalent Containerisation using Docker, and orchestration via Kubernetes or Docker Swarm Monitoring with ELK, Grafana or equivalent CI/CD with TeamCity Microsoft SQL Server Windows and Linux administration experience The ability to communicate clearly and professionally with our clients as well as team members More ❯
prior history as developer or the ability to code Infrastructure As Code - Ansible, Terraform or equivalent Containerisation using Docker, and orchestration via Kubernetes or Docker Swarm Monitoring with ELK, Grafana or equivalent CI/CD with TeamCity Microsoft SQL Server Windows and Linux administration experience The ability to communicate clearly and professionally with our clients as well as team members More ❯
prior history as developer or the ability to code Infrastructure As Code - Ansible, Terraform or equivalent Containerisation using Docker, and orchestration via Kubernetes or Docker Swarm Monitoring with ELK, Grafana or equivalent CI/CD with TeamCity Microsoft SQL Server Windows and Linux administration experience The ability to communicate clearly and professionally with our clients as well as team members More ❯
Sonatype Nexus Knowledge and working experience of containerising application components including writing DockerFiles and deploying to Kubernetes Deep understanding of pipelines as code Observability concepts and tooling; Opensearch, Cribl, Grafana, Prometheus, CloudWatch #J-18808-Ljbffr More ❯
in Linux/Unix environments and CLI tools like psql. • Experience with logical and physical replication, partitioning, and backup strategies. • Familiarity with monitoring tools (e.g., pg_stat_statements, Prometheus, Grafana). • Knowledge of scripting languages for automation and tooling. Preferred Qualifications: • Experience with containerized environments (Docker, Kubernetes). • Familiarity with cloud platforms (AWS RDS, GCP Cloud SQL, or Azure Database More ❯
and disaster recovery initiatives. Working knowledge of cloud-native storage solutions such as Longhorn. Strong Linux administration skills, particularly with RHEL environments. Experience implementing comprehensive observability solutions using Prometheus, Grafana, Loki, and related tools. Ability to establish and enforce security policies through tools like Open Policy Agent. Knowledge of identity management solutions such as Keycloak. Experience managing artifact repositories including More ❯
languages, such as C#, Python, Perl, Java, C++ CICD tools such as Azure DevOps, GitHub Actions, Gitlab, Jenkins, TeamCity Scripting languages such as PowerShell, bash Observability/Monitoring: Prometheus, Grafana, Splunk Containerisation tools such as Docker, K8S, OpenShift, EC, containers Hosting technologies such as IIS, nginx, Apache, App Service, LightSail Analytical and creative approach to problem solving We encourage you More ❯