Middlesex, South East, United Kingdom Hybrid / WFH Options
SKY
Working closely with the Platform Enablement team and the SRE team to drive new ideas into the product roadmaps around the deployment self-service, observability, security, and reliability Team Management Define and track key performance indicators (KPIs) to measure the success and impact of Linux initiatives. Fostering a team culture more »
serverless function runtime version upgrades) Infrastructure drift monitoring & management Softwaremaintenance (e.g. language/framework/package version upgrades) Key rotation management Tooling maintenance (e.g. observability stack) Performance and Load testing Role Requirements Strong knowledge of Microsoft technologies. Active Directory, Entra, SharePoint, 365, Windows 10/11, Intune, Application packaging (Intune more »
using Python and TypeScript/JavaScript. Strong API design skills and a familiarity with building web applications. A proponent of great testing, first-class observability and automating everything. Familiarity with security principles and practices, including authentication, authorization, and encryption. Experience with cloud technologies (e.g. Docker, Kubernetes, Helm & Terraform) and platforms more »
City of London, London, United Kingdom Hybrid / WFH Options
SKY
Working closely with the Platform Enablement team and the SRE team to drive new ideas into the product roadmaps around the deployment self-service, observability, security, and reliability Team Management Define and track key performance indicators (KPIs) to measure the success and impact of Linux initiatives. Fostering a team culture more »
About the role ThePrincipal CloudArchitect isa seniorrole, reporting to theCloud Operations Manageras part of our DevOps team within our Product & Technology business unit. The DevOpsteamis responsible forthe high level Architecture,Designand Operationof the ZellisCloud Estate,built onMicrosoft Azure andbest-in more »
understand requirements and integrate new features Lead code reviews and provide mentorship to junior engineers Ensure platform performance, security, and stability through testing and observability Evaluate and implement new technologies to enhance the platform s capabilities Troubleshoot and resolve complex issues related to the platform Define and enforce coding standards more »
services and technologies, particularly AWS, to optimize the performance and security of our cloud infrastructure. Monitor system health, performance, and availability using monitoring and observability tools, proactively identifying and resolving issues. Collaborate with cross-functional teams to troubleshoot and resolve complex infrastructure issues, minimizing downtime and improving system reliability. Mentor more »
stack development web and mobile front-end technologies such as React, Java, APIs & microservices, PostgreSQL, data structures, workflow Knowledge of Site Reliability Engineering, automation, observability, incident management, resilience, disaster recovery, high availability, documentation IAM engineering experience, authentication, authorisation, single sign-on, multi-factor authentication, user lifecycle management, hands on CI more »
infrastructure needs, ideally in a self-service environment. WHAT YOU'LL DO: Build and provide platform features for compute, authentication, service discovery and calls, observability and redundancy Deploy and maintain Ripple's multi-region, multi-provider service platforms (with an emphasis on security and resiliency) Develop tools for automation, monitoring more »
GCP) and containerization technologies (e.g., Docker, Kubernetes).Experience with configuration management tools (e.g., Ansible, Terraform) and CI/CD pipelines. Knowledge of monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack).Scripting and automation skills (e.g., Python, Bash).Excellent problem-solving and troubleshooting skills. Strong communication and collaboration skills. #J more »
scripting (Python or Shell) and familiar with CI/CD practices. Experience with Linux operating systems in high availability environments. Hands-on experience with observability tools like Grafana, Prometheus, and Splunk. Desirable Experience with public cloud platforms (AWS/GCP/Azure). Knowledge of data centre technologies and network more »
non-containerised applications on cloud and on premise Develop a high-performance cluster environment in close collaboration with research and development team Improve scalability, observability, security and performance of infrastructure Manage and support trading infrastructure operations Skills and Experience Our technology stack includes: Terraform Ansible Java Python Javascript/Typescript more »
GCP & Azure) Solid understanding of high-quality coding, testing, and development practices Preferable skills: Infrastructure as code (Eg Terraform/Cloudformation/Pulumi) Monitoring & Observability What you can expect from us We won't just meet your expectations. We'll defy them. So you'll enjoy the comprehensive rewards package more »
with a cloud provider (AWS/Azure/GCE), or sysadmin/SRE experience in data centers Experience designing, building, and operating high-scale observability or infrastructure systems Working knowledge of networking fundamentals, experience with CNIs or cloud networking infrastructure preferred What We Require 4+ years of professional software development more »
essential Experience diagnosing and troubleshooting connectivity issues Well-versed in security by design and security by default. Familiarity with SOC compliance Strong command of observability and monitoring in a multi-tenant cloud Experience working in high-performing Agile teams Proficient development in a major programming language such as C#, Python more »
will Take technical responsibility for all stages in the software development life cycle Take ownership of your code in production, including creating and managing observability, alerts and FinOps dashboards. Work with our Enablement, InfoSec and Support teams to ensure your code is secure, well-documented and supportable. Work closely with more »
lead and architecture to build high quality products Design and implement new payment systems. Seek to deliver performant and reliable products. Consider operational costs, observability, and support capabilities. Take an active part in design and architectural discussions. Own your tasks and be proud of your deliveries and work. Help with more »
for all of Canonical s core services, networks, and infrastructure Develop skills in troubleshooting, capacity planning, and performance investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services Provide assistance and work with globally more »
GCP & Azure) Solid understanding of high-quality coding, testing, and development practices Preferable skills: Infrastructure as code (Eg Terraform/Cloudformation/Pulumi) Monitoring & Observability #J-18808-Ljbffr more »
stay on , and we deliver the best possible experience to our customers internally and externally. Increase reliability in our CI/CD pipelines. Increase observability over the platform. In months three to six: Build and maintain a happy, productive and high performing team. Work collaboratively to solve complicated problems with more »
ThousandEyes, which now forms the ThousandEyes Business Unit within Cisco s Network Services Business Group, and is a foundational component of Cisco s growing Observability business. About The Role WAN Insights is the cornerstone of Cisco s Predictive Networks vision. We wanted to go beyond reacting to network glitches, we more »
a Lead DevOps Engineer Design and develop cutting edge computing infrastructure in a modern cloud environment - Azure and Kubernetes (AKS). Design and develop observability and monitoring solutions to optimize the product and end user experience. Leverage cutting edge technologies to ensure CI/CD in Azure DevOps Mentor junior more »
of experience in industry. Strong working knowledge of Golang. Experience with Kubernetes and the ecosystem of Cloud Native tools. Experience with building infrastructure with observability as a first class concept. Bonus skills Contributions to open source projects Experience using machine learning tools in production. A broad understanding of data science more »
management, and the prowess of cloud-native solutions. In your pursuit of continuous improvement, you're not solely reliant on metrics; you dive into observability metrics and user feedback, steering our technical progress with insightful analysis. Staying ahead is not just a practice; it's inherent. You're not merely more »
As part of the SRE team you will be participating in design and implementing parts of our engineering platform that enables scaling, metrics and observability, ensures and improves reliability. Identify gaps in our engineering platform that improves availability, latency, performance, efficiency, change management, monitoring, emergency response Guide and mentor other … denormalisation, CQRS-ES, Federation, etc ) Experience building and working with and monitoring microservice architectures in large distributed cloud environments (ideally AWS). Experience with Observability tooling having proficiency using tools like Elasticsearch, Kibana, APM, Sentry, Grafana, Prometheus, Overops, or similar The ability to guide and mentor other members within the more »