language (PowerShell, Python, Bash) Familiarity with Linux systems Strong understanding of cloud security principles and implementation of security controls in Azure. Experience with infrastructure monitoring tools such as Prometheus, Grafana, or Azure Monitor Strong analytical mindset with exceptional troubleshooting and problem-solving abilities Ability to plan and organize one's own work. Accurately report issues and progress Excellent communication skills More ❯
or Azure, including infrastructure provisioning, automation, and monitoring. Experience with implementing, managing, and using observability tools, data visualization, and application monitoring platforms such as Dynatrace, AWS CloudWatch, Azure Monitor, Grafana, Prometheus, or Datadog. Familiarity with error budgets and their role in balancing reliability and innovation. Direct experience building, launching, configuring, and maintaining AWS and/or Microsoft Azure cloud resources. More ❯
pipelines to ensure code quality and reliability; Experience of work with Docker for containerisation and application packaging; Experience of implementing and managing monitoring solutions, with experience in Prometheus and Grafana for observability and alerting. Experience of implementing and managing robust security practices, including Encryption (TLS) and Secret Management in the Cloud; Experience of leveraging GitLab API for advanced automation, integration More ❯
DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects in Jira More ❯
container orchestration platforms such as Kubernetes or Amazon ECS to streamline application deployment, scaling, and management. Monitoring and Logging: Implement monitoring and logging solutions using tools such as Prometheus, Grafana, ELK Stack, or Datadog to monitor system performance, detect issues, and troubleshoot problems proactively. Security and Compliance: Implement security best practices and compliance standards within DevOps processes and infrastructure, ensuring More ❯
or Python - ideally with prior development experience. Infrastructure as Code expertise (e.g., Ansible, Terraform). Containerisation with Docker and orchestration via Kubernetes or Docker Swarm. Monitoring experience with ELK , Grafana , or similar tools. CI/CD experience with TeamCity. Microsoft SQL Server. Windows and Linux administration skills. Excellent communication skills, with the ability to work closely with technical and non More ❯
Azure (Functions, Service Bus, etc.). Exposure to CI/CD automation, infrastructure as code, and automated testing frameworks. Familiarity with monitoring tools and practices for observability (e.g., Prometheus, Grafana, ELK stack). Damia Group Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept More ❯
Kubernetes concepts. Some experience with Linux systems and basic scripting (Bash, Python, or similar). Interest in CI/CD tools and processes. Eagerness to learn observability tools (Prometheus, Grafana, Datadog, etc.). Problem-solving mindset and willingness to troubleshoot with guidance. Strong written and verbal communication skills, with the ability to clearly articulate technical concepts to both technical and More ❯
Columbia, Maryland, United States Hybrid / WFH Options
Codescratch LLC
development tool suites. Preferred Skills and Experience: Experience with Docker and Kubernetes Experience with Hadoop Experience with Spark Experience with Accumulo Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Location: Columbia Annex, MD (60%+ telework) Salary Range: $115,000 - $200,000.00 More ❯
standard software development tool suites. Preferred Skills and Experience: Experience with Docker and Kubernetes Experience with Virtual Machines Experience with Networking Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElasticSearch, Logstash, Kibana) Have, or obtain Security+ certification or equivalent DoD 8570 IAT II certification Location Fort Eisenhower, GA (Appx 50% hybrid telework) Salary Range More ❯
principles and automation tools such as SaltStack, Puppet, and Ansible In-depth experience with trouble-shooting large Linux Clusters Demonstrated experience using system monitoring tools such as Prometheus/Grafana Experience with containerization technologies such as Docker Demonstrated experience administrating/monitoring Kubernetes clusters Experience with the Atlassian Tool Suite (JIRA, Confluence) Experience using Git for version control Position Desired More ❯
and using REST and/or RPC APIs Desired Skills Experience with Messaging Frameworks such as Kafka, ActiveMQ, and RabbitMQ Experience with tools used for metrics visualization such as Grafana and Kibana Experience with Git Source Control System Experience with the Atlassian Tool Suite (JIRA, Confluence More ❯
in Linux/Unix environments and CLI tools like psql. • Experience with logical and physical replication, partitioning, and backup strategies. • Familiarity with monitoring tools (e.g., pg_stat_statements, Prometheus, Grafana). • Knowledge of scripting languages for automation and tooling. Preferred Qualifications: • Experience with containerized environments (Docker, Kubernetes). • Familiarity with cloud platforms (AWS RDS, GCP Cloud SQL, or Azure Database More ❯
Orlando, Florida, United States Hybrid / WFH Options
INSPYR Solutions
to manage competing demands. Experience integrating with enterprise applications Knowledge of Tools Desired: Atlassian Suite (Jira, Confluence), GitHub, GitLab, Jenkins Knowledge of additional tools including ServiceNow, Elastic APM, Prometheus, Grafana, SolarWinds, SQL is a plus Experience with Linux Operating System Experience with Windows IIS Servers a plus Extensive knowledge of software and systems development and Agile methodologies/processes. Agile More ❯
. Strong understanding of distributed systems, microservices architecture, and RESTful API design. Hands-on experience with Kubernetes and container orchestration. Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position. Hands-on experience with incident response, including designing and improving incident management processes. Expertise in More ❯
Reigate, Surrey, England, United Kingdom Hybrid / WFH Options
esure Group
performance and resilience. Qualifications What we’d love you to bring: Experience of AWS (particularly EC2, EKS, Lambda, S3, IAM, etc) Monitoring/alerting tools (for example we use Grafana, Prometheus, Loki, CloudWatch and Dynatrace) Knowledge of monitoring best practices for a variety of different platforms and technologies Docker and Kubernetes Git/Gitlab Jenkins/CI/CD/ More ❯
a pipeline Bachelor's degree in Computer Science, Engineering, or equivalent practical experience Desired Skills Exposure to bare metal provisioning tools (Ironic, MaaS) Hands on use of observability platforms (Grafana, Prometheus, Splunk) Familiarity with public cloud services (AWS, GCP, Azure) Basic understanding of data center networking and security frameworks (NIST, STIGs) OpenStack certification (e.g., Certified OpenStack Administrator) Our Commitment to More ❯
/Golang Provisioning software/frameworks (Elasticsearch/Spark/Hadoop/Airflow/PostgreSQL) Infrastructure Management - CasC, IasC (Ansible, Terraform, Packer) Log and metric aggregation with Fluentd, Prometheus, Grafana, Alertmanager Public Cloud, primarily GCP and Azure, but also AWS What do I need to have? Take pride in designing, building and delivering high quality well engineered solutions to complex More ❯