Strong documentation and decision-recording capabilities. Experience in on-prem and hybrid cloud solutions. Strong security awareness, including identity and access management. Expertise in observability and monitoring solutions. What They Offer : Learning Opportunities: Access to training, workshops, and professional development courses to enhance your skills and knowledge. Dedicated learning and More ❯
teams to execute effectively. DataOps and Optimization: Drive DataOps principles to improve workflows. Partner with platform teams to establish CI/CD pipelines and observability standards, enhancing operational efficiency and reliability. Data Governance and Quality: Embed governance, security, and quality practices. Define access control, data lineage, and compliance standards. Promote More ❯
enterprise solutions in infrastructure and applications. They deliver software-defined storage, private cloud solutions, container runtime environments, as well as databases, message queues, identity, observability, analytics, machine learning, and web publishing capabilities. Their customers appreciate that they can get security compliance and support for everything open source from a single More ❯
Enablement and Optimization: Drive the adoption of modern DataOps principles to streamline engineering workflows. Partner with platform teams to establish CI/CD pipelines, observability standards that improve operational efficiency, reliability, and speed across data pipelines. Data Governance and Quality Assurance: Embed governance, security, and data quality practices into engineering More ❯
backend and frontend codebases; blur the line between FE and BE to deliver holistic solutions Integrate third-party APIs, AI providers, and logging/observability tools as needed Is this you? Minimum 5+ years of experience in full stack software engineering Expert-level knowledge of Python, especially in building production More ❯
engineering. You are a natural communicator and build productive relationships with stakeholders and peers. Technologies you'll work with Java jMeter Grafana and other observability platforms Our Commitment to Diversity, Equity, Inclusion and Belonging Build your career in a place that thrives on diversity, equity, inclusion, and belonging. We believe More ❯
comprehensive approach to data control, compliance, and security; unconstrained by their infrastructure providers. Our platform mitigates data security risks while enhancing communication, automation, and observability across data flows, enabling teams to collaborate effortlessly across the organisation. We have hubs in London and New York, and we are looking for people More ❯
of the art system! Key job responsibilities You are interested in being part of a high-bar team with global responsibility for the Network Observability software used across all of Amazon's businesses. Solving problems at scale excites you. You do not implement specs from analysts and architects - you talk More ❯
scalability. Topics you may work with include driving the display, connectivity, power management, over-the-air updates, and various generalized APIs for diagnostics and observability of the hardware of our products. You will also be a part of a multidisciplinary team based in London consisting of middleware, cloud, and embedded More ❯
Maidenhead, Berkshire, United Kingdom Hybrid / WFH Options
Squared Up
selling B2B SaaS products at an enterprise level. Successful track record selling to Service Providers, ideally within the Microsoft Partner community. Experience in the observability/monitoring space is highly desirable. Must have a good understanding of how MSPs operate and the value they add to their customers. Have managed More ❯
company relies Leading the team strategy, including defining and tracking its success measures Driving the operational excellence of the team, including its on-call, observability and security postures Working closely with our telecommunication and networking partners and managing those relationships Building a team of diverse & experienced engineers, and guiding them More ❯
A willingness to take ownership from implementation to production, which implies experience in testing, containerisation, continuous integration and delivery, authentication/authorisation, telemetry/observability/monitoring. Experience developing within micro-frontend architectures. Some experience, professional or otherwise, developing applications for Kubernetes. CKAD is a plus, but by no means More ❯
digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out More ❯
customers to a desired outcome, without prescribing it Authoritative skills at cloud computing (network, security, serverless, Kubernetes etc) and automation Experience with implementation of Observability and Reliability using market technologies (e.g.: New Relic, Dynatrace) Good experience with Performance Engineering (load testing, derivations, tuning, core web vitals, page speed etc.) Expertise More ❯
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Promote Project
Secondary Responsibilities Assist with support and bug triage. Assist with CI/CD pipeline as necessary. Assist with E2E tests as necessary. Improve application observability with logging and automated alerting. Explain technical concepts to non-technical stakeholders. Guide and mentor other engineers of all levels with their professional growth and More ❯
Technical expertise: Strong knowledge of Cybersecurity and associated standards, including PCI-DSS, TSA, Secure-by-Design. Knowledge of technology and operational landscape for SOC, Observability, Zero-Trust, Knowledge of at least one out of the two technology domains: Cloud Technology and Infrastructure (Private Cloud, Public Cloud, Telco Cloud) Networking (IP More ❯
Data Science, Platform, and Product teams to ensure AI and ML models are deployed, monitored, and maintained effectively. Promote best practices in model deployment, observability, CI/CD for ML, and responsible AI principles, helping engineers embed these into their workflows. Oversee the development and implementation of scalable AI and More ❯
SRE), you will play a key role in ensuring the reliability, scalability, and efficiency of our clients' platforms. Your focus will include building strong observability practices, aligning with the SRE mindset & principles, and driving continuous improvement. This will involve: Defining and implementing Service Level Indicators (SLIs) and Service Level Objectives … support, depending on client requirements. Key expectations from this role include: As a Consultant : Lead site reliability engineering initiatives with a strong emphasis on observability, ensuring high performance and reliability of applications & infrastructure. Provide strategic insights to shape the overall SRE strategy while collaborating on the design and implementation of … solutions. Establish effective monitoring, alerting and incident response strategies to maintain system availability and promote continuous improvement by collaborating with team members to deliver observability best practices and SRE methodologies. As part of your role you will also have the opportunity to contribute to the business and your own personal More ❯
implement reusable cloud-uptime components as code. Regularly review and optimise SRE practices, tools, and methodologies to enhance overall system reliability and team efficiency. Observability and Automation: Contribute to the design, implementation, and maintenance of observability and monitoring solutions to track the platform health, its cost-effectiveness, the reliability, and More ❯
and Kurbernetes Deployment and automation using tools such as Jenkins, GitHub, GitHub Actions, CodePipeline, and CodeDeploy Responsible for DevOps CI/CD pipelines Monitoring & Observability using monitoring solutions like AppDynamics and CloudWatch for proactive system health monitoring. Troubleshooting Java/JVM applications, heap dump analysis, handled memory leakage issues Key … DevOps CI/CD : Proficiency in CI/CD tools including Jenkins, GitHub, GitHub Actions, CodePipeline, and CodeDeploy for streamlined deployment and automation. Monitoring & Observability: Familiarity with monitoring solutions like AppDynamics and CloudWatch for proactive system health monitoring. Ticketing & Collaboration Tools: Experience with Jira and ServiceNow for incident management and More ❯
and Kurbernetes Deployment and automation using tools such as Jenkins, GitHub, GitHub Actions, CodePipeline, and CodeDeploy Responsible for DevOps CI/CD pipelines Monitoring & Observability using monitoring solutions like AppDynamics and CloudWatch for proactive system health monitoring. Troubleshooting Java/JVM applications, heap dump analysis, handled memory leakage issues Key … DevOps CI/CD : Proficiency in CI/CD tools including Jenkins, GitHub, GitHub Actions, CodePipeline, and CodeDeploy for streamlined deployment and automation. Monitoring & Observability: Familiarity with monitoring solutions like AppDynamics and CloudWatch for proactive system health monitoring. Ticketing & Collaboration Tools: Experience with Jira and ServiceNow for incident management and More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Embarcaderomediagroup
of our engineering operations, bringing together SRE principles and modern platform engineering practices. This includes combining principles of SRE - such as service-level reliability, observability, incident response - with platform engineering practices like GitOps, Infrastructure as Code, DevSecOps automation, and self-service enablement, to help development teams ship faster, safer, and … more cost-efficiently. What you'll be doing: Designing and operating highly reliable, scalable, and secure Azure-based platforms Applying SRE principles like SLOs, observability, and incident management to drive service reliability Building Infrastructure as Code using Terraform (v1.7+) and GitOps workflows Enabling teams through platform tools, reusable Terraform modules … DB, etc.) Strong Infrastructure as Code skills with Terraform (v1.7+) Experience with CI/CD pipelines, GitOps, and automation tools (PowerShell, Bash) Familiarity with observability and incident tools like Datadog, ELK, and synthetic monitoring Solid understanding of networking (TCP/IP, Load Balancing, DNS, Routing) Good knowledge of DevSecOps practices More ❯
API strategy, and front-end development. Driving Automation & DevOps Best Practices - Champion CI/CD pipelines, cloud transformation, and platform engineering principles, ensuring robust observability, security, and cost efficiency. Engineering Governance & Innovation - Continuously improve engineering processes, team productivity, and technical debt reduction, driving a shift-left approach and best-in … Python, Spark, Git (GitLab), Docker, Kubernetes and Cloud development (GCP & Azure). Experience working with React.js Strong knowledge of DevOps, CI/CD pipelines, observability, and cloud security best practices. Ability to drive engineering strategy, process improvements, and high-velocity agile execution. Experience hiring, mentoring, and leading global teams across More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Hargreaves Lansdown Asset Management Limited
Excited to grow your career? Our purpose is to empower people to save and invest with confidence. We are looking for great people to join us, so please come and invest in YOUR future at HL. We know that sometimes More ❯
bristol, south west england, united kingdom Hybrid / WFH Options
Hargreaves Lansdown Asset Management Limited
Excited to grow your career? Our purpose is to empower people to save and invest with confidence. We are looking for great people to join us, so please come and invest in YOUR future at HL. We know that sometimes More ❯