Hawthorne, California, United States Hybrid / WFH Options
GCR Professional Services
with hardware-in-the-loop (HIL) testing environments. Improve monitoring, logging, and debugging capabilities for embedded applications. Manage containerization and virtualization of embedded development environments using tools like Kubernetes, Grafana and OpenTelemetry Research and implement best practices for security, performance, and scalability. Automate software releases and version control strategies for embedded firmware. Skills and/or Experience Needed: MS or More ❯
or Python with the ability to code confidently Infrastructure as Code (Ansible, Terraform, or equivalent) Containerisation using Docker, with orchestration via Kubernetes or Docker Swarm Monitoring expertise with ELK, Grafana, or equivalent tools CI/CD experience with TeamCity Knowledge of Microsoft SQL Server Experience in both Windows and Linux administration Excellent communication skills for both client and team interactions More ❯
and disaster recovery initiatives. Working knowledge of cloud-native storage solutions such as Longhorn. Strong Linux administration skills, particularly with RHEL environments. Experience implementing comprehensive observability solutions using Prometheus, Grafana, Loki, and related tools. Ability to establish and enforce security policies through tools like Open Policy Agent. Knowledge of identity management solutions such as Keycloak. Experience managing artifact repositories including More ❯
/Deployment Pipelines (GitLab CI) • Linux (RHEL/CentOS, Ubuntu) • Containerisation (Docker) and Container Orchestration Systems (Elastic Kubernetes Service) • Continuous inspection (Sonarqube, Wiz) • Logging (CloudWatch, CloudTrail, Splunk) • Monitoring (Prometheus, Grafana) #LI-DT1 Together, as owners, let’s turn meaningful insights into action. Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential More ❯
language (PowerShell, Python, Bash) Familiarity with Linux systems Strong understanding of cloud security principles and implementation of security controls in Azure. Experience with infrastructure monitoring tools such as Prometheus, Grafana, or Azure Monitor Strong analytical mindset with exceptional troubleshooting and problem-solving abilities Ability to plan and organize one's own work. Accurately report issues and progress Excellent communication skills More ❯
or Azure, including infrastructure provisioning, automation, and monitoring. Experience with implementing, managing, and using observability tools, data visualization, and application monitoring platforms such as Dynatrace, AWS CloudWatch, Azure Monitor, Grafana, Prometheus, or Datadog. Familiarity with error budgets and their role in balancing reliability and innovation. Direct experience building, launching, configuring, and maintaining AWS and/or Microsoft Azure cloud resources. More ❯
pipelines to ensure code quality and reliability; Experience of work with Docker for containerisation and application packaging; Experience of implementing and managing monitoring solutions, with experience in Prometheus and Grafana for observability and alerting. Experience of implementing and managing robust security practices, including Encryption (TLS) and Secret Management in the Cloud; Experience of leveraging GitLab API for advanced automation, integration More ❯
DevOps to optimize build times, parallelize tests, and reduce pipeline flakiness. Result Analysis & Root Cause • Analyze test outputs, system logs, and metrics (e.g., via ELK Stack or Prometheus/Grafana) to pinpoint failures and performance regressions. • Lead root-cause investigations for infrastructure incidents, producing clear post-mortem reports and remediation recommendations. Defect Management • Log, triage, and track defects in Jira More ❯
a proven ability to provide technical leadership and mentor colleagues. Experience with containers and orchestration tools (e.g., Docker, Kubernetes). Extensive knowledge of monitoring and observability tools (e.g., Prometheus, Grafana). A strong background working with large-scale digital transformation projects in the public sector. More ❯
container orchestration platforms such as Kubernetes or Amazon ECS to streamline application deployment, scaling, and management. Monitoring and Logging: Implement monitoring and logging solutions using tools such as Prometheus, Grafana, ELK Stack, or Datadog to monitor system performance, detect issues, and troubleshoot problems proactively. Security and Compliance: Implement security best practices and compliance standards within DevOps processes and infrastructure, ensuring More ❯
or Python - ideally with prior development experience. Infrastructure as Code expertise (e.g., Ansible, Terraform). Containerisation with Docker and orchestration via Kubernetes or Docker Swarm. Monitoring experience with ELK , Grafana , or similar tools. CI/CD experience with TeamCity. Microsoft SQL Server. Windows and Linux administration skills. Excellent communication skills, with the ability to work closely with technical and non More ❯
Azure (Functions, Service Bus, etc.). Exposure to CI/CD automation, infrastructure as code, and automated testing frameworks. Familiarity with monitoring tools and practices for observability (e.g., Prometheus, Grafana, ELK stack). Damia Group Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept More ❯
Kubernetes concepts. Some experience with Linux systems and basic scripting (Bash, Python, or similar). Interest in CI/CD tools and processes. Eagerness to learn observability tools (Prometheus, Grafana, Datadog, etc.). Problem-solving mindset and willingness to troubleshoot with guidance. Strong written and verbal communication skills, with the ability to clearly articulate technical concepts to both technical and More ❯
Columbia, Maryland, United States Hybrid / WFH Options
Codescratch LLC
development tool suites. Preferred Skills and Experience: Experience with Docker and Kubernetes Experience with Hadoop Experience with Spark Experience with Accumulo Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElsticSearch, Logstash, Kibana) Experience with asynchronous messaging systems (RabbitMQ, Apache Kafka, etc.) Location: Columbia Annex, MD (60%+ telework) Salary Range: $115,000 - $200,000.00 More ❯
standard software development tool suites. Preferred Skills and Experience: Experience with Docker and Kubernetes Experience with Virtual Machines Experience with Networking Experience monitoring application performance with metrics (Prometheus, InfluxDB, Grafana) and logs with ELK Stack (ElasticSearch, Logstash, Kibana) Have, or obtain Security+ certification or equivalent DoD 8570 IAT II certification Location Fort Eisenhower, GA (Appx 50% hybrid telework) Salary Range More ❯
principles and automation tools such as SaltStack, Puppet, and Ansible In-depth experience with trouble-shooting large Linux Clusters Demonstrated experience using system monitoring tools such as Prometheus/Grafana Experience with containerization technologies such as Docker Demonstrated experience administrating/monitoring Kubernetes clusters Experience with the Atlassian Tool Suite (JIRA, Confluence) Experience using Git for version control Position Desired More ❯
and using REST and/or RPC APIs Desired Skills Experience with Messaging Frameworks such as Kafka, ActiveMQ, and RabbitMQ Experience with tools used for metrics visualization such as Grafana and Kibana Experience with Git Source Control System Experience with the Atlassian Tool Suite (JIRA, Confluence More ❯
Strong Git experience (GitLab, GitHub, branching strategies, peer reviews). Familiarity with serverless (AWS Lambda, API Gateway) or cloud-native architectures . Knowledge of logging, monitoring, and observability (ELK, Grafana, CloudWatch). Understanding of GDS Technology Code of Practice and DDaT capability framework. More ❯