pipeline scalability and governance while maintaining developer velocity. Observability & Troubleshooting Lead the implementation and usage of modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, Splunk, Datadog). Establish SLOs, SLIs, and error budgets with product and engineering teams. Drive root cause identification using distributed tracing, advanced log analysis, and anomaly detection. More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
Ocho
For: Solid frontend skills with React & TypeScript. Experience working within a cloud-native AWS environment. Strong understanding of GraphQL and API development. Experience with Datadog or other monitoring tools is a plus. Someone who loves problem-solving, enjoys team collaboration, and takes pride in building great products. Interested ? Reach out More ❯
in a high-level language (e.g., Python, JavaScript, TypeScript, Java) and familiarity with modern development practices Understanding of Cloud Observability, Monitoring, and Tracing tools (Datadog, CloudWatch, Jaeger, ELK) and how best to leverage to support effective MTTR and mitigate high CFR #LI-Hybrid Our UK benefits: Stock Options Annual Performance More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Group
IaC tools like Terraform, CloudFormation, or Ansible. Strong knowledge of cloud-native technologies (Kubernetes, Docker, Serverless, etc.). Experience with monitoring tools (e.g., Prometheus, Datadog, New Relic). Familiarity with networking concepts, security protocols, and identity management. General Experience building SaaS applications. Strong problem-solving and analytical skills. Excellent communication More ❯
and automation (e.g., Bash, Python, PowerShell). Familiarity with CI/CD pipelines and deployment automation. Experience with environment monitoring tools (e.g., Prometheus, Nagios, Datadog). Knowledge of security best practices and compliance standards in IT environments. Excellent problem-solving, troubleshooting, and analytical skills. Strong communication skills, with the ability More ❯
to cloud platforms (e.g., AWS, GCP or Azure), an understanding of containerisation (e.g., Docker), infrastructure-as-code software (e.g., Terraform), and observability platforms (e.g., Datadog or Grafana). Curiosity : A hunger to learn and grow your skills. Problem solving: Strong analytical problem-solving skills and attention to detail. You have More ❯
Systems, or related fields is desirable but not essential. Nice to have but not essential: Service monitoring and graphing tools (Prometheus + Grafana, Nagios, Datadog) Elastic Stack Repository solutions (JFrog Artifactory, JFrog Bintray) OpenVPN SQL Databases (MongoDB, PostgreSQL, MySQL) Our Values: We work together We believe in people We won More ❯
Systems, or related fields is desirable but not essential. Nice to have but not essential: Service monitoring and graphing tools (Prometheus + Grafana, Nagios, Datadog) Elastic Stack Repository solutions (JFrog Artifactory, JFrog Bintray) OpenVPN SQL Databases (MongoDB, PostgreSQL, MySQL) Our Values: We work together We believe in people We won More ❯
in at least one language (.Net, C#, node, java, go, c/C++, ruby). Experience with logging and monitoring tools such as ELK, DataDog or NewRelic, LogEntries, SumoLogic, etc. BENEFITS: Competitive Base Salary Employee Performance Related Bonus Clear Career Progression Flexible Working Hours Matched Employee Pension Generous Holiday Package More ❯
in at least one language (.Net, C#, node, java, go, c/c++, ruby). Experience with logging and monitoring tools such as ELK, DataDog or NewRelic, LogEntries, SumoLogic, etc. BENEFITS: Competitive Base Salary Employee Performance Related Bonus Clear Career Progression Flexible Working Hours Matched Employee Pension Generous Holiday Package More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Group
deployment tools (e.g., Gitlab CI, Argo workflow, Argo CD). Experience in managing distributed systems like Kafka. Experience with monitoring/logging solutions (e.g., DataDog, ELK, Prometheus). Good understanding of concepts related to computer architecture, data structures and programming practices. Solid understanding of networking, databases, and security principles. Strong More ❯
of which should have focus on Observability. Excellent knowledge and hands-on experience with monitoring, logging, and tracing tools such as Prometheus, VictoriaMetrics, Grafana, Datadog, New Relic, OpenTelemetry, ELK Stack, or similar. Experience with high volume data storage (Structured and unstructured). A strong technical background, with current capabilities and More ❯
models and related infrastructure Monitoring and Observability: Build and maintain comprehensive monitoring and alerting systems for our ML infrastructure and models, leveraging tools like DataDog to ensure system health and performance Collaboration and Mentorship: Collaborate effectively with data scientists, engineers, and other stakeholders. Provide guidance and support to junior team … CD pipelines Containerization and Orchestration: Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes) Monitoring and Logging: Experience with monitoring and logging tools like DataDog, Prometheus, or Grafana Data Engineering Skills: Knowledge of event streaming platforms (e.g., Apache Kafka) and SQL database management Strong Communication and Collaboration: Excellent communication skills More ❯
Northern Ireland, United Kingdom Hybrid / WFH Options
Jobgether
GCP), and databases such as MySQL, DynamoDB, or Cassandra. Proficiency with infrastructure-as-code tools like Terraform or Pulumi, and observability tools such as Datadog or CloudWatch. Experience in implementing AI-powered tools for workflow optimization and operational improvements. Proven success in setting up scalable, SLO-driven monitoring strategies in More ❯
you to sleep through the night. Support the production systems on prem (legacy) and GCP which includes build, deployment (CI/CD), and monitoring (Datadog, PagerDuty). Deliver high quality code by following coding standards, performing code reviews and ensuring test coverage at all levels. Collaborate with engineers and other More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
JR United Kingdom
focusing on aspects like optimisation, scalability, and efficiency. You’ll work alongside teams that have joined from world-class tech businesses like NVIDIA, Amazon, Datadog, Vercel, Meta, GitHub, and Uber. Key Responsibilities Partner with customers to identify and address their ML deployment needs. Implement and optimise ML solutions using Python More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
JR United Kingdom
usage analysis Experience with database security, access control, compliance requirements, and disaster recovery. Experience with cloud database services and monitoring tools including Grafana and Datadog Proficiency in Python for database automation and tooling development Knowledge of CI/CD pipelines, infrastructure-as-code, and modern development practices Experience with unit More ❯
similar Must have experience with Kubernetes, CI/CD, and Terraform Database experience with MySQL and NoSQL Monitoring tools like Prometheous, Grafana, NewRelic, Zabbix, DataDog, or Similar Relevant Certifications like GCP Cloud Architect or similar Beneficial, but not essential: Cassandra, and Google Workspace What's in it for you More ❯
Lisburn, Northern Ireland, United Kingdom Hybrid / WFH Options
Camlin Group
Threat Analyst with the ability to create new operational guidelines, processes, and procedures. Experience with e.g. Fortinet Suite, AWS Security Hub, AWS Inspector, FortiAnalyzer, DataDog or similar tools. Able to script in Python to deal with OS, files, and security operations requirements. Good knowledge of Linux and Windows OS vulnerabilities. More ❯
Belfast, Northern Ireland, United Kingdom Hybrid / WFH Options
Ocho
Full-Stack Engineer – Python, AWS, React, TypeScript, GraphQL , Datadog- Applying for this role is straight forward Scroll down and click on Apply to be considered for this position. Remote First | Competitive salary + excellent benefits (up to 10% pension CC ) | Immediate interviews available I’m working with a market-leading … clear career progression. Tech Stack You’ll Be Working With: Backend: Python Frontend: React, TypeScript Infrastructure: AWS (Lambda, ECS, S3, RDS) Data & Monitoring: GraphQL, Datadog DevOps & CI/CD: Docker, Terraform, GitHub Actions What You’ll Be Doing: Building and maintaining scalable full-stack applications in a fast-paced, high … experience. Solid frontend skills with React & TypeScript. Experience working within a cloud-native AWS environment. Strong understanding of GraphQL and API development. Experience with Datadog or other monitoring tools is a plus. Someone who loves problem-solving, enjoys team collaboration, and takes pride in building great products. Interested ? Reach out More ❯