clients products Finding opportunities to exploit cloud native technologies with clients' products Being part of designing and delivering cloud-native applications that deliver on key architectural requirements (scalability, reliability, observability, secure etc) and DevOps best practices Providing technical guidance, mentoring, and support to the development teams and other architects Designing applications that can be supported and maintained Your key skills More ❯
including ontologies (OWL/RDF) and graph databases (e.g., Neo4j). Familiarity with the concepts behind the Model Context Protocol (MCP) or similar advanced agentic architectures. Experience with modern observability stacks, particularly OpenTelemetry. Experience designing multi-tenant enterprise software platforms. Knowledge of enterprise security patterns and identity management systems. More ❯
contracts, data flows, and integration patterns. Technical Stewardship Apply microservice and event-driven architectural patterns, leveraging technologies such as Kafka for asynchronous communication and decoupling. Champion operational excellence, supporting observability, testing, deployment, and incident response practices. Embed and advocate engineering best practices: scalable design, high availability, resilience, and continuous improvement. Skills & Experience Must-have Proven experience designing and delivering architectures More ❯
series storage, and high-frequency analytics. Lead the design and governance of data models that support complex trading strategies, asset optimization, and regulatory reporting. Ensure data quality, lineage, and observability across all layers of the data stack. Strategic Collaboration & Business Alignment Partner with trading desks, quantitative teams, and risk functions to translate business needs into data solutions that enhance decision More ❯
technologies (listed below) is advantageous: Kotlin Cloud Technologies (Kubernetes, Open Shift) Messaging Technologies (Kafka, Solace, TIBCO) Database/Data Store/Data Query Technologies (SQL Server, Trino, Mongo, S3) Observability Technologies (OpenTelemetry, Elastic Stack/ELK, Grafana) This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required. More ❯
cloud infrastructure and automation skills using AWS, Terraform, Python, and Lambda functions. This role involves designing, implementing, and maintaining service monitoring solutions while leveraging cloud-native technologies for scalable observability platforms. Key Responsibilities Design KPIs, service definitions, dashboards, and glass tables Configure correlation searches, events, and predictive analytics Build dependency mapping and topology visualization Deploy and manage AWS infrastructure with More ❯
bradford, yorkshire and the humber, united kingdom
Netcompany UK Limited
cloud infrastructure and automation skills using AWS, Terraform, Python, and Lambda functions. This role involves designing, implementing, and maintaining service monitoring solutions while leveraging cloud-native technologies for scalable observability platforms. Key Responsibilities Design KPIs, service definitions, dashboards, and glass tables Configure correlation searches, events, and predictive analytics Build dependency mapping and topology visualization Deploy and manage AWS infrastructure with More ❯
cloud infrastructure and automation skills using AWS, Terraform, Python, and Lambda functions. This role involves designing, implementing, and maintaining service monitoring solutions while leveraging cloud-native technologies for scalable observability platforms. Key Responsibilities Design KPIs, service definitions, dashboards, and glass tables Configure correlation searches, events, and predictive analytics Build dependency mapping and topology visualization Deploy and manage AWS infrastructure with More ❯
migration and optimisation of applications using Azure cloud-native services Write clean, testable, and maintainable code following industry standards Implement and maintain CI/CD pipelines, test automation, and observability practices using Azure DevOps or JIRA Apply AI-powered development tools (e.g., GitHub Copilot) and automation techniques to improve productivity Enhance tools, technologies, and ways of working to foster innovation More ❯
running on Java 21. We're in the process of moving our backend services to Spring Boot. We've invested heavily in our DataDog integration to bring world class observability and monitoring to our systems. We've recently moved to Gitlab and are currently building out our next generation of automated deployment pipelines. We've incorporated some of the best More ❯
Sheffield, South Yorkshire, Yorkshire, United Kingdom
Essential Consulting
Background and Role We are looking to onboard a Senior Software Engineer (Platform Engineer) on a contract basis for a large global bank. The Senior Software Engineer will join a small, dedicated 4-person team within Infrastructure in the Chief More ❯
handsworth, yorkshire and the humber, united kingdom
Essential Consulting
Background and Role We are looking to onboard a Senior Software Engineer (Platform Engineer) on a contract basis for a large global bank. The Senior Software Engineer will join a small, dedicated 4-person team within Infrastructure in the Chief More ❯
AWS Public Cloud infrastructure and implementation of IaC using Terraform. The role will work closely with the SRE and Engineering teams to ensure that the Cloud environment has sufficient observability and is appropriately managed.Skills and experience required: Strong technical operational skills in supporting AWS Cloud Hosted environments, and at least 3 years in an Infrastructure support role Strong understanding of More ❯
AWS Public Cloud infrastructure and implementation of IaC using Terraform. The role will work closely with the SRE and Engineering teams to ensure that the Cloud environment has sufficient observability and is appropriately managed.Skills and experience required: Strong technical operational skills in supporting AWS Cloud Hosted environments, and at least 3 years in an Infrastructure support role Strong understanding of More ❯
environments. Relevant Skills: Experience working in Agile environments Strong understanding of Site Reliability Engineering (SRE) principles Familiarity with Azure DevOps for CI/CD and pipeline management Knowledge of observability tools: Prometheus, Grafana, Loki, Tempo Experience with Infrastructure as Code: Helm, Kustomize Hands-on experience with Tekton and ArgoCD Ability to support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM More ❯
Salary : Up to £100,000 (DOE) + Equity Location : Fully remote (UK only) Stack : Python , TypeScript, GCP, Pub/Sub, SQL & NoSQL, IaC (Terraform), CI/CD (GitHub Actions), Observability tools, AI tooling Youll join a remote first, high-trust engineering team working with a modern, cloud-native stack - with real influence over technical decisions from day one. Youll take More ❯
Solid experience building and deploying services with Java and Spring Boot. Comfort working in a cloud-native environment - Kubernetes (EKS), containers, scaling etc. An interest in observability, using tools like Prometheus and Grafana to keep services healthy and understand usage patterns. Familiarity with AWS services and how to integrate them into modern applications. A keen focus on quality and security More ❯
architectures Knowledge with infrastructure as code IaC (Terraform, CloudFormation, AWS CDK) Understanding of CI/CD pipelines and DevOps practices Experience in serverless application development Proficient with monitoring and observability tools Why Join - Growth & Opportunity: Be part of a thriving company with a culture built on innovation and collaboration. - Flexibility: Enjoy the freedom of remote-first work with regular in More ❯
that automate their processes. Contribute to the development of our Virtual Agent development platform that scales with our product strategy. Ensure our AI services maintain high standards of reliability, observability, availability, and performance. Participate in our machine learning community to influence how we implement machine learning and computer vision technologies, shaping Unitary's future. Take ownership of customer outcomes with More ❯
that automate their processes. Contribute to the development of our AI agent development platform that scales with our product strategy. Ensure our AI services maintain high standards of reliability, observability, availability, and performance. Participate in our machine learning community to influence how we implement machine learning and computer vision technologies, shaping Unitary's future. Take ownership of customer outcomes with More ❯
components such as market data feeds, order gateways, execution algorithms, risk engines, UI dashboards, middle office reconciliation, and account infrastructure. We emphasize event-driven, deterministic system design, real-time observability, and strong security. Our tech stack includes Java (low-latency), Python, Web UI (React/Ag-Grid), Aeron, ClickHouse, Kubernetes, and modern CI/CD tooling, with a strong focus More ❯
Code principles Design an agile release engineering strategy that delivers value incrementally and continuously Support a highly-available live production system, respond to alerts, diagnose problems using logs and observability tooling, triage and resolve incidents What we offer We make sure our team is well looked after with generous salaries and a great benefits package which includes: Enhanced pension with More ❯
California, with additional locations across the globe. What you'll do: As a Site Reliability Engineer at Zefr, you'll apply your expertise in cloud infrastructure, CI/CD, Observability, and core SRE concepts, to deliver high-quality, reliable, and scalable solutions. A significant aspect of this role involves working closely with Zefr's Engineering and Data Science teams ensuring … EKS expected), Helm, Kustomize Service Mesh: Istio CI/CD & Automation: CI/CD Pipelines: GitHub Actions GitOps/Continuous Delivery: Argo CD Primary Scripting/Automation Language: Python Observability & Monitoring: Monitoring & Alerting: Prometheus, Datadog, Pagerduty Telemetry Standards: OpenTelemetry Application & Data Ecosystem (Supporting): Application Languages/Frameworks: Python, FastAPI, Flask, Node.js, React Data Streaming: Apache Kafka Data Processing/Transformation … CircleCI, Argo CD, Flux) Knowledge of IaC and configuration management tools (Terraform, OpenTofu, Crossplane, Pulumi, Ansible, CloudFormation) Strong problem-solving experience, focusing on automation Production experience with Monitoring and Observability tools (Prometheus, Grafana, Datadog, Thanos, New Relic, Open Telemetry) Understanding of Cloud Networking concepts (Mesh Networking, NAT, Load Balancers, SSL Certificates and TLS termination, API Gateways, proxies, etc) Strong written More ❯
for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with strong project management and stakeholder communication skills. Must have skills: -Splunk -ELK Stack -Kibana Nice to have skills: -stakeholder communication skills -strong project management Responsibilities: Minimum number More ❯
for leading and executing the migration of data, dashboards, alerts, and configurations from Splunk systems to Elasticsearch. This role involves deep technical expertise in Splunk architecture, data ingestion, and observability tools, along with strong project management and stakeholder communication skills.Must have skills: -Splunk -ELK Stack -KibanaNice to have skills: -stakeholder communication skills -strong project managementDetailed Job Description: -Ability to deploy More ❯