and associated tooling (Helm, Kustomize). Practical understanding of cloud security, networking protocols, and secure deployments. Experienced in observability stackslogging (e.g., ELK), monitoring (e.g., Prometheus), and tracing (e.g., Jaeger, OpenTelemetry). Cloud certifications (AWS, GCP, or similar) are advantageous. Excellent problem-solving skills, especially in high-availability environments (broadcasting or streaming experience is a plus). Enthusiastic, proactive team player More ❯
Have Experience in e-commerce , fintech, or other large-scale transactional systems. Exposure to event-driven architecture, streaming technologies, or data-intensive applications. Familiarity with observability tools (Prometheus, Grafana, OpenTelemetry). Whats on Offer Competitive day rate Opportunity to work on high-traffic, mission-critical systems at scale. Initial 6-month contract with strong potential for extension. How to Apply More ❯
real-world products Knowledge of LLM frameworks or retrieval-augmented generation (RAG) systems Exposure to event-driven architectures or real-time data streaming Familiarity with observability tools (Prometheus, Grafana, OpenTelemetry) Why Join Youll be part of a small, high-impact team tackling ambitious technical challenges at the intersection of AI and scalable infrastructure . We value curiosity, craftsmanship, and speed More ❯
real-world products Knowledge of LLM frameworks or retrieval-augmented generation (RAG) systems Exposure to event-driven architectures or real-time data streaming Familiarity with observability tools (Prometheus, Grafana, OpenTelemetry) Why Join Youll be part of a small, high-impact team tackling ambitious technical challenges at the intersection of AI and scalable infrastructure . We value curiosity, craftsmanship, and speed More ❯
Lambda, ECS/Fargate, S3, DynamoDB, CloudWatch, API Gateway) Data & Messaging: PostgreSQL, Redis, Kafka or SQS CI/CD & Infrastructure: Docker, Terraform, GitHub Actions, CloudFormation Monitoring & Observability: Prometheus, Grafana, OpenTelemetry Testing: Pytest, integration and load testing frameworks Key Skills & Expertise Proven experience designing and delivering production systems using Python on AWS . Strong understanding of distributed systems, API design, and More ❯
Lambda, ECS/Fargate, S3, DynamoDB, CloudWatch, API Gateway) Data & Messaging: PostgreSQL, Redis, Kafka or SQS CI/CD & Infrastructure: Docker, Terraform, GitHub Actions, CloudFormation Monitoring & Observability: Prometheus, Grafana, OpenTelemetry Testing: Pytest, integration and load testing frameworks Key Skills & Expertise Proven experience designing and delivering production systems using Python on AWS . Strong understanding of distributed systems, API design, and More ❯
native platforms (ArgoCD, Prometheus, Istio, etc.) Prior work with CI/CD tools (GitHub Actions, Azure DevOps) in Kubernetes-based deployment pipelines Experience with UI telemetry and observability (Grafana, OpenTelemetry, etc.) Familiarity with SAP landscape operations or enterprise automation products AI/ML UI/UX feature integration experience is a strong plus Selected applicant will be subject to a More ❯
Python) or JMeter, with data parameterization and correlation. Manage distributed load generation (containers, cloud workers) to simulate millions of concurrent users. Integrate performance metrics from CloudWatch, Prometheus, Grafana, and OpenTelemetry to analyze system bottlenecks. Develop SLA/SLO dashboards and integrate performance gates into CI/CD pipelines. Collaborate with DevOps and developers to tune JVM, Akka, thread pools, GC More ❯
Lead SRE/Observability Engineering Lead - (Outside IR35 Contract/Remote) Location: Bristol/London HQ Largely Remote (Occasional Travel) Day Rate: Outside IR35 650 to 750 p/d Duration: 3-6 Months Initial with intention to extend Payment More ❯
At Frasers Group were rethinking retail. Through digital innovation and unique store experiences, were serving our consumers with the worlds best sports, premium and luxury brands globally. As a leader in the industry, were elevating the retail experience for our More ❯
GCP logging. Leading the Proof of Concept (PoC) implementation to validate the end-to-end data flow. Leading workshops to finalize technical details for GCP Managed Prometheus and the OpenTelemetry collector configuration. Technology Skills: Certified Google Cloud expert Deep knowledge of GKE GCP Monitoring (Cloud Logging, Metrics) GCP Managed Prometheus More ❯