and Kubernetes Solid understanding of microservices and container-based architecture Experience with build and config tools like Jenkins, Maven, or Gradle Familiarity with observability tools such as Prometheus and Grafana Working knowledge of software design principles and event-driven architecture Hands-on experience with APIs, MQTT (IoT), and test-driven development Background in Agile methodologies such as XP or Scrum More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
INTEC SELECT LIMITED
Conduct architecture reviews, technical audits, and drive adoption of best practices Partner with infrastructure teams to ensure system reliability and operational efficiency Integrate monitoring and logging solutions (e.g., Prometheus, Grafana, ELK) Define strategies for disaster recovery, scaling, and infrastructure resilience Improve observability by enhancing visibility into performance and error metrics Skills and Experience Required 10+ years of backend development experience More ❯
or all of the following: configuration management, orchestration, CI/CD, infrastructure monitoring and telemetry Experience using Agile (e.g. Kanban or Scrum) Familiarity with telemetry tools such as Splunk, Grafana Experience with Web frameworks (BENTO, REACT, Angular, DJANGO) Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age More ❯
multi-account AWS setups. Extensive experience with AWS Organisations Expert-level knowledge of AWS Networking, TLS, and security best practices. Experience with container orchestration (Kubernetes, EKS) and observability tools (Grafana, ELK). A passion for innovation, problem-solving, and delivering high-impact solutions. Working with Control Tower and Landing Zones Why Work For Us? Competitive base salary up to More ❯
or all of the following: configuration management, orchestration, CI/CD, infrastructure monitoring and telemetry Experience using Agile (e.g. Kanban or Scrum) Familiarity with telemetry tools such as Splunk, Grafana Experience with Web frameworks (BENTO, REACT, Angular, DJANGO) Bloomberg is an equal opportunity employer and we value diversity at our company. We do not discriminate on the basis of age More ❯
of technical experience in Cloud DevOps, SaaS, or observability, with 5+ years in leadership roles. Strong hands-on experience with AWS, GCP, Azure, K8S, Terraform and observability tools: Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Datadog, and similar. Proficiency with metrics, logs, traces and APM. Leadership & Global Operations Proven success leading multi-regional or global technical teams with direct management of managers. More ❯
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
Kafka). Strong grasp of telemetry, observability, and performance monitoring in distributed systems. Track record of technical leadership and setting engineering standards. Nice to Have: Experience with OpenTelemetry , Prometheus, Grafana, or similar observability tooling. Exposure to hybrid-cloud or cloud migration strategies. Familiarity with performance optimisation in low-latency data pipelines. Contributions to DevOps-related communities, blogs, open source, or More ❯
deep understanding of UNIX, Linux, networking (TCP/IP), and databases (both relational and NoSQL). Experience in android and iOS application debugging. Experience with observability tools such as Grafana and Prometheus, and skills in documenting procedures for knowledge management. Strong interpersonal and communication skills to thrive in fast-paced, dynamic environments. NOTE: As part of the operation staff members More ❯
Airflow, or on common problems such as model and API monitoring, data drift and validation, autoscaling, access permissions Have previously worked with monitoring tools such as New Relic or Grafana Understand the use of feature stores and related data technologies for operational machine learning products Are proficient with Python and have Spark knowledge. Have leadership experience either through previous management More ❯
to managing our infrastructure, using Terraform. - We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. - We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We're in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). - We manage a data pipeline using Pub/Sub More ❯
field. Preferred Qualifications Proven expertise and experience with database technologies including NoSQL databases like MongoDB, RDBMS such as Postgres and MySQL Exposure to Docker, Kubernetes, AWS, Helm, Terraform, Vault, Grafana, ELK Stack, New Relic Relevant experience in the maintenance of data APIs and data lake architectures, including experience with Apache Iceberg, Trino/Presto, Clickhouse, Snowflake, BigQuery. Master's degree More ❯
provided by GCP/AWS, such as S3, FSX, EKS, SQS, SNS, Kinesis, AmazonMQ, DynamoDB, GKE, CloudStorage, PubSub, Filestore, Knowledge of modern observability technologies such as ELK, Splunk, Prometheus, Grafana, Micrometer "What-if" thinking, while designing or reviewing solutions, to foresee or catch potential problems as early in the development process, as only possible Nice to have: Good knowledge of More ❯
primary language for our backend codebase AWS & GCP - we're cloud-native Kubernetes (EKS) Microservice based architecture RESTful APIs PostgreSQL, JDBI, Flyway TeamCity for CI/CD Terraform and Grafana The Team: The Core Banking group is seeking passionate engineers ready to tackle complex challenges and contribute to foundational systems, powering modern banking, that process millions of transactions daily, ensuring More ❯
applications Familiarity with infrastructure-as-code tools such as Terraform Understanding of security best practices in web infrastructure and application delivery Exposure to observability tooling and techniques (e.g., Prometheus, Grafana, structured logging) Confident in debugging and resolving issues in complex distributed or web-based Systems A product mindset and collaborative approach to improving how teams build and run software Bonus More ❯
IPoE 13. Proven ability to work independently & collaboratively in a fast-paced technical environment. 14. Demonstratable knowledge of the telecommunications industry & technologies. 15. Experience of working with Prometheus and GrafanaMore ❯
City of London, London, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Comfortable managing deployments using CI/CD pipelines (Github Actions, Jenkins, etc.) *Solid understanding of cloud infrastructure including AWS, Kubernetes, and contect delivery *Exposure to observability tooling (Datadog, Sentry, Grafana) and performance tuning best practice Reference Number: BBBH259301 To apply for this role or for to be considered for further roles, please click 'Apply Now' or contact Tommy Williams at More ❯
London, South East, England, United Kingdom Hybrid / WFH Options
Rise Technical Recruitment Limited
Comfortable managing deployments using CI/CD pipelines (Github Actions, Jenkins, etc.)*Solid understanding of cloud infrastructure including AWS, Kubernetes, and contect delivery*Exposure to observability tooling (Datadog, Sentry, Grafana) and performance tuning best practice Reference Number: BBBH259301 To apply for this role or for to be considered for further roles, please click "Apply Now" or contact Tommy Williams at More ❯
predictive analytics. Understanding of AI frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn) and their application in network automation and monitoring. Experience with telemetry and observability frameworks (e.g., Prometheus, Grafana) for real-time network monitoring and troubleshooting. Experience : Minimum of 7 years' of experience in network engineering, operations, and support. Proven ability to work hands-on and take strong technical More ❯
for you. Ideally you have several years experience using Go in production. You'll be comfortable with Docker, and familiar with modern observability tools such as Prometheus, Alert Manager, Grafana and X-Ray/Tempo/Jaeger. We're looking for 3+ years tackling hard backend problems Seasoned database experience - we use MySQL, DynamoDB, Elasticsearch and Redis Experience with microservices More ❯
for you. Ideally you have several years experience using Go in production. You'll be comfortable with Docker, and familiar with modern observability tools such as Prometheus, Alert Manager, Grafana and X-Ray/Tempo/Jaeger. We're looking for 3+ years tackling hard backend problems Seasoned database experience - we use MySQL, DynamoDB, Elasticsearch and Redis Experience with microservices More ❯
scale then this role is for you. Minimum Requirements Several years experience using Go in production. Comfortable with Docker. Familiar with modern observability tools such as Prometheus, Alert Manager, Grafana and X-Ray/Tempo/Jaeger. Seasoned database experience - we use MySQL, DynamoDB, Elasticsearch and Redis. Experience with microservices and distributed systems. Used to developing complex services from requirements More ❯
meaningfully to the success of the team and company. Nice to Have: Experience with Nginx, proxies, and managing traffic between different services. Experience with Docker and Kubernetes. Familiarity with Grafana and other monitoring tools. Prior experience with Scala and Java is a plus. Experience with Workday and Microsoft Entra What we offer You will have the opportunity to be part More ❯
for the highly entrepreneurial domain of shipping. Technology Stack We build cloud solutions using C#, .NET Core, Typescript, React, MSSQL Server, Azure, Python, MongoDB, Redis, Docker, Kubernetes, Service Fabric, Grafana and more. We also offer the opportunity to work on state-of-the-art architectures and technology stacks, as well as a global, Artificial Intelligence-powered technology product vision. We More ❯
Willingness to tackle challenging problems and make meaningful contributions to the success of both the team and the organization. Nice to Have: Experience with Docker and Kubernetes. Familiarity with Grafana and other monitoring tools. Prior experience with Scala and Java is an advantage. What we offer You will have the chance to be involved in something impactful, large-scale, and More ❯