intervention for building and deploying application, services. Monitoring & Logging: Working with SRE team to Setup monitoring and alerting systems using tools like Prometheus, Grafana, Datadog or others to ensure high availability and performance of application and infrastructure Collaboration : Collaborate with architect, operations and developers to ensure seamless integration between development More ❯
GCP) and on-prem infrastructure (VMware, Linux). Automate repetitive tasks to improve operational efficiency. Set up monitoring and alerting systems using Prometheus, Grafana, Datadog, etc. Collaborate with architects, operations, and developers for seamless integration. Implement security protocols, access controls, encryption, vulnerability scanning. Provide troubleshooting support for deployment and DevOps More ❯
as Python, Java Spring Boot, or .Net. Strong knowledge of software applications and emerging technical disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk. Experience with CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containerization and orchestration tools like Docker, Kubernetes, ECS. Expertise in SRE More ❯
.Net. Deep knowledge of software applications and technical processes, with emerging expertise in specific technical disciplines. Experience with observability tools like Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Proficiency with CI/CD tools such as Jenkins, GitLab, Terraform. Experience with containerization and orchestration tools More ❯
Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker More ❯
facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. About Us J.P. Morgan is a global More ❯
facilitates innovation Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, Terraform Experience in at least one observability tool such as Dynatrace, Datadog, New Relic, CloudWatch, AppDynamics, Splunk., Preferred Qualification Experience a plus in common SRE toolchains: Grafana, Prometheus, Elasticsearch, Kibana, Jaeger. About the Team J.P. Morgan is More ❯
and infrastructure management. AWS certifications are advantageous. · Strong experience with Infrastructure as Code tools (Terraform, CloudFormation) · Familiarity with observability and monitoring tools (CloudWatch, Splunk, Datadog). · Experience managing CI/CD workflows, especially with GitHub Actions. · Strong knowledge of artifact repository management systems like JFrog. · Proficient in Linux administration and More ❯
Docker, Kubernetes, AKS, and Helm. Programming skills in Python, Java, PowerShell, or Go, with understanding of REST APIs. Experience with observability tools such as DataDog, Prometheus, Splunk, Elasticsearch, Grafana, Azure Monitor. Experience with CI/CD tools like Git, Terraform, Jenkins. Azure cloud expertise in mission-critical environments. Additional qualifications More ❯
.Net Deep knowledge of software applications and technical processes, with emerging expertise in specific disciplines Proficiency in observability tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk Experience with CI/CD tools like Jenkins, GitLab, Terraform Experience with containerization and orchestration tools such as ECS, Kubernetes, Docker Expertise in SRE More ❯
Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker More ❯
pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. More ❯
pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. More ❯
Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others. Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform. Familiarity with container and container orchestration such as ECS More ❯
and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability More ❯
and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability More ❯
and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability More ❯
Familiar with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, New Relic, CloudWatch, AppDynamics. Familiarity with containers or a common Server OS such as Linux and Windows Emerging knowledge of software, applications and technical More ❯
software applications and technical processes, with emerging expertise in one or more technical disciplines. Proficiency in observability tools such as Grafana, Geneos, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLO alerting, and telemetry collection. Experience with CI/CD tools like Jenkins, GitLab, Terraform. Experience with containerization and orchestration tools such More ❯
Java. Experience maintaining a cloud-based infrastructure. Familiarity with site reliability principles, concepts, and practices. Knowledge of observability tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, New Relic, CloudWatch, or AppDynamics. Familiarity with containers or common server operating systems like Linux and Windows. Emerging knowledge of software, applications, and technical More ❯
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Geneos, Dynatrace, Prometheus, Datadog, Splunk, etc. Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.) Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker More ❯
or Java. Experience maintaining cloud-based infrastructure. Familiarity with site reliability concepts, principles, and practices. Knowledge of observability tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, New Relic, CloudWatch, and AppDynamics, including monitoring, alerting, and telemetry collection. Familiarity with containers and common server operating systems like Linux and Windows. Emerging More ❯
the following would be a plus: Experience with Observability across multiple domains (APM, Infrastructure, Synthetics, Logs, etc ) within cloud and on-premise environments using Datadog, Azure Monitor and Application Insights. NewRelic and Grafana Experience working in B2B SaaS companies Experience with cloud containers, specifically Kubernetes Responsibilities & Duties Develop: Architecture, strategy More ❯