South East London, England, United Kingdom Hybrid / WFH Options
Xpertise Recruitment
will involve ensuring that the design of new cloud services and features is optimal, considering various functional and non-functional attributes, including performance, availability, observability, security and cost. Whilst remaining hands-on, you’ll also maintain a strategic vision, ensuring that a fast-paced development cycle converges on your chosen more »
enhancing efficiency. Enforce adherence to digital principles, ensuring the integrity, security, and compliance of solutions while meeting both functional and non-functional requirements. Embed observability into solutions, monitoring production performance, resolving incidents, and addressing underlying risks and issues. Advocate for client requirements while maintaining discretion and confidentiality. Standardise best practices more »
understanding of web development technologies, including PHP, MySQL, HTML, CSS, and JavaScript. Mastery of PHP 8.2 and Laravel 9+, emphasising a DevOps mindset, including observability, monitoring, and alerts. Proficiency in working with APIs and integrating third-party services. Excellent problem-solving skills, with an ability to troubleshoot application issues and more »
understanding of web development technologies, including PHP, MySQL, HTML, CSS, and JavaScript. Mastery of PHP 8.2 and Laravel 9+, emphasising a DevOps mindset, including observability, monitoring, and alerts. Proficiency in working with APIs and integrating third-party services. Excellent problem-solving skills, with an ability to troubleshoot application issues and more »
Developer you will sit within the Platform Engineering, you will play a vital position in delivering the future of their Salesforce & AWS delivery and observability platforms. You will get the chance to collaborate across multiple teams, understanding their business needs, and ensuring that our DevOps platforms are always aligned with more »
West Drayton, England, United Kingdom Hybrid / WFH Options
Hays
and requirements: Working as part of a team or as the engagement lead with experience in multicloud networking, SASE, datacentre networking, network simplification, automation, observability and application networking and a strong understanding and alignment of security as it relates to the previous technology areas. Leveraging this experience to deliver: Enterprise more »
South East London, London, United Kingdom Hybrid / WFH Options
Stepstone UK
proficient in Next.js; Familiar with React Native. Highly skilled in performance-oriented programming & modern software engineering toolchain technologies Experienced in leveraging design systems with observability and monitoring tools like Datadog, Grafana and Cloudwatch. Have deep knowledge of best practice in micro-frontend, microservices, event-driven architectures and responsive web design. more »
Greater London, England, United Kingdom Hybrid / WFH Options
The Thinking Traveller
thousands of pounds per transaction for our clients and partners Continuously improve our testing infrastructure and operations tools Maintain existing systems and build in observability Release new features using feature flagging and A/B testing Collaborate effectively with stakeholders to understand their needs and clients’ needs to develop our more »
of web development technologies, including PHP, MySQL, HTML, CSS, and JavaScript. Practical experience with PHP 8.2 and Laravel 9+, emphasising a DevOps mindset, including observability, monitoring, and alerts. Proficiency in working with APIs and integrating third-party services. Excellent problem-solving skills, with an ability to troubleshoot application issues and more »
Complexio is Foundational AI. This works to automate business activities by ingesting whole company data – both structured and unstructured – and making sense of it. Using proprietary models and algorithms Complexio forms a deep understanding of how humans are interacting and more »
Dublin, City of Dublin, Republic of Ireland Hybrid / WFH Options
Reperio Human Capital
The Site Reliability Engineer will be responsible for building observability frameworks. This role naturally suits a candidate who is a true tech lover and is self taught in a number of areas. Fully Remote - Ireland The client requires the successful candidate to have availability for meetings in United States time more »
Java/Kotlin) Mobile Development understanding (Swift/Kotlin) Strong OOP/Data Structure/Design Patterns understanding Cloud computing knowledge Understanding of cloud observability concepts (logging, monitoring, alerting etc.) This role is hybrid, with the office based in London. The successful hire will report to the Chief Product Officer. more »
South East London, England, United Kingdom Hybrid / WFH Options
Futureheads Recruitment | B Corp™
Development experience (Java/Kotlin)Mobile Development understanding (Swift/Kotlin)Strong OOP/Data Structure/Design Patterns understandingCloud computing knowledgeUnderstanding of cloud observability concepts (logging, monitoring, alerting etc.)This role is hybrid, with the office based in London.The successful hire will report to the Chief Product Officer.Unfortunately we more »
designing, developing, and deploying applications based on microservices. Event-Driven Systems: Hands-on experience with Apache Kafka or similar distributed messaging systems. Monitoring and Observability: Familiarity with monitoring tools like Prometheus, Grafana, or Victoria Metrics. Database Technologies: Experience working with various databases, including: - TSDB: InfluxDB, TimescaleDB - GDB: Dgraph, Neo4j This more »
Senior Data Engineer About us HENI is a technology company pioneering art markets and information. We work with world-leading artists and estates across various sectors, including printmaking, book publishing, NFTs, digital content, and art research – all underpinned by cutting more »
London, England, United Kingdom Hybrid / WFH Options
DigitalOcean
organization, our employees, like our customers, are based around the world. We want people who care about empowering their peers to understand their services. Observability Platforms is focused on creating visibility into DigitalOcean's services and infrastructure. We design, build, and operate the internal logging, metrics, distributed tracing, error reporting … of their products and services in order to improve the experiences of our customers. What You'll Be Doing: Integrating and operating open source observability software such as Elasticsearch, Prometheus, VictoriaMetrics, Alertmanager, Grafana, and Kibana. Implementing features to both improve the operability of our services and help design and implement … best utilize our systems to gain confidence that their services are performing as expected. What We'll Expect From You: Experience with open source observability systems such as Elasticsearch and Prometheus, not just as an end user but as someone responsible for running and maintaining them. Familiarity with running software more »
Manchester, England, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for An Observability Engineer to be part of a team responsible for the provisioning and development of a modern observability platform. You will be administering, maintaining and evolving our observability platform while providing best practice and monitoring solutions to support all our critical products and systems. … remediation which are critical for maintaining the health and uptime of our on premise and cloud applications. Our recent move from a monitoring to observability first mind-set is a critical part of our DevOps strategy and as such, Observability Engineers will be expected to provide a suitable platform for … key insights, help to reduce mean time to detect (MTTD) and mean time to repair (MTTR) and introduce practices and standards for ensuring that observability is not an afterthought for new systems. This role is eligible for inclusion in the Company’s hybrid working from home policy. Preferred skills and more »
shape how everything runs at THINKalpha and be a leading voice in how we work and build our infrastructure. Your Work Configure and maintain observability tooling with Datadog and PagerDuty (Slack channels) Contribute to our IaC codebase by creating and maintaining Terraform and Ansible modules, and participate in the review … tools. Experience with both on-premise/colocated servers as well as cloud infrastructure, and hybrid deployments spanning both types of environments. Experience with observability platforms (e.g., DataDog) and alarm systems (e.g., PagerDuty) >Nice to have< Coding background in at least one language (Node, JavaScript, Python, C++, etc) Understanding of more »
and at scale. Play a critical role in evolving our infrastructure to address complex technical challenges related to reliability, latency, bandwidth, and security. Improve observability, monitoring, and alerting throughout the platform. Coordinate work across different areas of the company to ensure efficient execution. Centralize common streams of work currently duplicated more »
Oxfordshire, South East, United Kingdom Hybrid / WFH Options
La Fosse Associates Ltd
Monitoring and Observability Engineer Salary - £50,000 - £55,000 - Fully remote role! Principal Accountabilities Design, implement, and manage monitoring solutions to ensure the availability, performance, and reliability of our systems. Collaborate with cross-functional teams to understand system requirements and implement effective monitoring strategies. Utilise expertise in Logic Monitor, OpenSearch … Proficient experience with other monitoring tools such as Dynatrace, New Relic, Splunk, Datadog, Nagios, Prometheus etc. Take ownership of the development of monitoring and observability practices Benefits include: 25 days holiday + statutory Competitive pension match Car allowance Family health care more »
London, England, United Kingdom Hybrid / WFH Options
La Fosse Associates Ltd
Monitoring and Observability Engineer Salary - £50,000 - £55,000 - Fully remote role! Principal Accountabilities Design, implement, and manage monitoring solutions to ensure the availability, performance, and reliability of our systems. Collaborate with cross-functional teams to understand system requirements and implement effective monitoring strategies. Utilise expertise in Logic Monitor, OpenSearch … Proficient experience with other monitoring tools such as Dynatrace, New Relic, Splunk, Datadog, Nagios, Prometheus etc. Take ownership of the development of monitoring and observability practices Benefits include: 25 days holiday + statutory Competitive pension match Car allowance Family health care JBRP1_UKTJ more »
Winchester, Hampshire, United Kingdom Hybrid / WFH Options
Context Recruitment
work closely with cross-functional teams to build and maintain a robust infrastructure that supports their dynamic needs. Key Responsibilities: Assume responsibility for the observability suite, encompassing tools for monitoring, logging, and alerting, to guarantee a thorough and integrated understanding of system functionality and health. Set up and oversee APM … Experience in a DevOps/Site Reliability Engineer ( SRE ) position, dedicated to ensuring the high availability, reliability, and scalability of live systems. Proficient in observability tools like Prometheus, ELK stack, Grafana, and Azure Monitor, capable of fully managing the suite for optimal system oversight. Skilled in operating APM tools such more »
West London, London, United Kingdom Hybrid / WFH Options
Daniel James Resourcing Ltd
two. Demonstrates a keen understanding of AWS and other cloud costs, attributing them to specific teams and services. Possesses extensive knowledge and experience in observability, including best practices, implementations, and familiarity with observability vendors. Champions diversity and inclusion, fostering a culture of innovation, teamwork, and self-improvement. Leads by example more »
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
for those that operate them. You will work with a wide range of technologies developing solutions, consulting with development teams and working with contemporary observability and incident management tools to assist the Business. You will be required to make effective decisions to improve the health and maintain the availability and … knowledge of SRE principles, including the creation and management of effective SLI’s and SLO’s for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty. Excellent knowledge of programming languages including Python, Golang and JavaScript. Knowledge and more »
collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include managing and configuring observability platforms such as DataDog and PagerDuty to provide proactive monitoring of production (and other) environments, design and implementation of automation processes to drive efficiencies, leading … a similar SRE/Site Reliability Engineer position You have experience of running 24x7 services in the public cloud - Azure You have experience with observability tools such as DataDog and PagerDuty You have a good knowledge of Containerisation - Kubernetes, AKS You have strong scripting skills for automation, PowerShell or Python more »