Senior Data Engineer About us HENI is a technology company pioneering art markets and information. We work with world-leading artists and estates across various sectors, including printmaking, book publishing, NFTs, digital content, and art research – all underpinned by cutting more »
technical POC for the company. Work closely with key decision makers in the company. Requirements: Experience with cloud technologies and DevOps principles. Familiarity with Observability concepts and hands-on experience with tools like Prometheus, OpenSearch, etc. Proficiency in delivering presentations in fluent English; proficiency in additional languages is a plus. more »
engineering mindset is crucial so we're much more interested if you can talk about the successes (and failures) of impacting problems like latency, observability, performance tuning etc. If you also can discuss a track record in building B2B SaaS products, that would be excellent. They are a startup and more »
team of versatile, skilled developers and product/design teams. ⭐ You'll set and maintain best practices for the team around development, security, testability, observability and performance, whilst also identifying and resolving issues as a wider team. ⭐The development team pride themselves on a family-feel culture, prioritising excellence over more »
ownership of tasks within specific projects and/or influence their direction. Functions, Duties, Tasks: Oversee operational health across multiple domains: knowledge, DevOps, security, observability, cost, improvement, and portfolio processes. Lead the team in work planning and prioritization, refining a continuously prioritized backlog, and identifying improvement opportunities. Act as the more »
3rd party systems coordinating with the relevant suppliers and internal teams Make sure your System integration can be diagnostically analysed, where practical, building in observability, in order to resolve faults, be maintainable and Determine that the outcome of the integration has met the organisation's needs Provide Technology Operations teams more »
shape how everything runs at THINKalpha and be a leading voice in how we work and build our infrastructure. Your Work Configure and maintain observability tooling with Datadog and PagerDuty (Slack channels) Contribute to our IaC codebase by creating and maintaining Terraform and Ansible modules, and participate in the review … tools. Experience with both on-premise/colocated servers as well as cloud infrastructure, and hybrid deployments spanning both types of environments. Experience with observability platforms (e.g., DataDog) and alarm systems (e.g., PagerDuty) >Nice to have< Coding background in at least one language (Node, JavaScript, Python, C++, etc) Understanding of more »
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
Seargin
to improve productivity. Experience of integrating with API services Sound knowledge of mobile applications - SOLID principles and design patterns Working experience in monitoring and observability, automated CI/CD pipelines Role - Hybrid Mobile IOS developer Location - Leeds Duration - 6-12 months Rate - £500 to £550 Inside IR35 a day all more »
and at scale. Play a critical role in evolving our infrastructure to address complex technical challenges related to reliability, latency, bandwidth, and security. Improve observability, monitoring, and alerting throughout the platform. Coordinate work across different areas of the company to ensure efficient execution. Centralize common streams of work currently duplicated more »
a vital role in maximising the value of data. Our client is looking for a hands-on Data Governance Manager to work to improve observability and usability of data within the organisation and to ensure compliance with group policies. You’ll be expected to drive these outcomes, designing, and deploying more »
experience in a similar role Extensive leadership experience in technology services, projects, and products within complex global environments. Proficiency in data governance, engineering, and observability, with a focus on public cloud services like Azure and Google. Expertise in data tools and platforms, preferably those within the specific portfolio. Familiarity with more »
Dynatrace and has demonstrable experience (at least 3 years) as a Site Reliability Engineer. The Site Reliability Engineer (SRE) will take ownership of the observability suite, leveraging deep DevOps skills and experience to proactively enhance the performance and stability of APIs and applications. This role will play a crucial part … ensuring reliability and scalability including managing APM tools such as Dynatrace or New Relic. Main Responsibilities as Site Reliability Engineer: Take ownership of the observability suite, including monitoring, logging, and alerting tools, to ensure comprehensive and holistic visibility into system performance and health. Configure and manage APM tools such as … expertise in monitoring, logging, and alerting tools such as Prometheus, ELK stack, Grafana, Azure Monitor etc., with the ability to take ownership of the observability suite. Experience managing APM tools such as Dynatrace or New Relic, utilizing their capabilities to monitor application performance effectively. Deep understanding of DevOps principles and more »
Oxfordshire, South East, United Kingdom Hybrid / WFH Options
La Fosse Associates Ltd
Monitoring and Observability Engineer Salary - £50,000 - £55,000 - Fully remote role! Principal Accountabilities Design, implement, and manage monitoring solutions to ensure the availability, performance, and reliability of our systems. Collaborate with cross-functional teams to understand system requirements and implement effective monitoring strategies. Utilise expertise in Logic Monitor, OpenSearch … Proficient experience with other monitoring tools such as Dynatrace, New Relic, Splunk, Datadog, Nagios, Prometheus etc. Take ownership of the development of monitoring and observability practices Benefits include: 25 days holiday + statutory Competitive pension match Car allowance Family health care more »
organisation We are seeking a Senior SRE to join our client as their first SRE and play a pivotal role in constructing a comprehensive observability platform. If successful, you will be responsible for designing, deploying, and maintaining a system that grants visibility into their IT infrastructure and operations. Your Role … Architect and implement a comprehensive observability and traceability platform. Identify and address gaps in monitoring coverage, collaborating with cross-functional teams to implement solutions. Proactively identify and remediate system performance issues. Develop and implement strategies to enhance system reliability and scalability. Partner with stakeholders to define and configure alerting mechanisms. more »
London, England, United Kingdom Hybrid / WFH Options
ITV Jobs
defining and following software standards and processes, from peer code reviews to coding standards. · Follow best DevOps and DevSecOps practices to ensure successful delivery, observability, operation and security of software in production. · Work with test and operations teams to troubleshoot, resolve issues and write unit and automated functional tests. Process …/CD tooling. · Worked with source control systems such as GitHub. Other things will make you stand out (key criteria) · Experience with a mainstream Observability platform (such as DataDog). · Knowledge of the UK broadcast industry & broadcast/OTT advertising market. · Knowledge of the digital marketing and advertising industry. · Knowledge more »
and help us maintain our hosting platform. Creating and improving routes to live with automation including blue/green & canary strategies. Configure and improve observability controls. Proving scalability/resilience and security controls. Sustain and improve the process of knowledge sharing throughout the engineering teams. About us… Like the modern … experience configuring & running production workloads in Kubernetes CI/CD & IaC tools like Jenkins, Terraform, Sonar, Nexus, Git, Spinnaker, Harness Strong understanding & experience of Observability, SRE, DevSecOps & FinOps Good understanding of cloud networking & connectivity patterns Good understanding of key data tooling such as Kafka, BigTable, DataProc, BigQuery etc.. It would more »
/SRE team. The Lead Site Reliability Engineer will lead the charge in selecting, configuring, and supporting Cloud Platform components and tooling. Proficiency in observability tech such as Grafana and Prometheus is essential. An ability to self-manage in both Agile and traditional delivery approaches is a key asset The … will be paramount for collaborating with stakeholders and mentoring team members. Key Skills Experience with GCP, AWS or Azure Leadership/management experience Terraform Observability tech such as Grafana/Prometheus Background in software engineering is an advantage If you are interested in the role please apply! We are an more »
Winchester, Hampshire, United Kingdom Hybrid / WFH Options
Context Recruitment
work closely with cross-functional teams to build and maintain a robust infrastructure that supports their dynamic needs. Key Responsibilities: Assume responsibility for the observability suite, encompassing tools for monitoring, logging, and alerting, to guarantee a thorough and integrated understanding of system functionality and health. Set up and oversee APM … Experience in a DevOps/Site Reliability Engineer ( SRE ) position, dedicated to ensuring the high availability, reliability, and scalability of live systems. Proficient in observability tools like Prometheus, ELK stack, Grafana, and Azure Monitor, capable of fully managing the suite for optimal system oversight. Skilled in operating APM tools such more »
slicing. Develop and maintain backend services using Node.js. Utilize Linux, AWS, serverless technologies, message queues, relational and NoSQL databases, containers, Infrastructure as Code, and observability tools in development processes. Advocate for and practice Test-Driven Development (TDD) and paired programming. Embrace lean and agile methodologies, ensuring the delivery of small … public-facing APIs and microservice architectures. Familiarity with AWS or GCP, Docker, Linux, and C# is desirable. Experience with CI/CD pipelines, scripting, observability, and data engineering is a plus. Understanding and advocacy of lean software development principles. Personal Attributes: Passionate about software development with a focus on quality. more »
all the pieces of the puzzle that fit together to make a secure, repeatable and scalable continuous delivery pipeline. Everything from secrets management to observability, you’ve got a go-to toolchain that you have proven to work all the way through to production. Pragmatic & Versatile Architecture Skills: You’re … looking for specific experience of administrating and provisioning Grafana Loki, and/or the LGTM stack or similar Infrastructure automation with Terraform or CloudFormation. Observability and the associated toolchain and techniques Containers and container orchestrators: Kubernetes, Istio Preferably experienced using OpenTelemetry and tools like Honeycomb History of client facing or more »
Cilium as a robust Container Network Interface (CNI) solution. You'll collaborate closely with DevOps teams, network architects, and security professionals to enhance network observability, security, and performance. Key Responsibilities Deployment and Configuration: Deploy Cilium as a DaemonSet into Kubernetes clusters. Configure Cilium to leverage eBPF for efficient packet processing … network policy enforcement, and observability. Ensure seamless integration with existing CNI solutions. Network Observability: Utilize Hubble (Cilium's observability companion) to monitor network activities in Real Time. Leverage eBPF to gain insights into network flows, security policies, and process behavior within Kubernetes workloads. Performance Tuning: Optimize Cilium's performance by … Cilium and eBPF in Kubernetes environments. Technical Skills: Proficiency in Kubernetes networking concepts. Strong understanding of eBPF technology and its applications. Familiarity with network observability tools (eg, Hubble). Scripting skills (eg, Python, Go) for automation. Certifications: Cilium Certified Engineer (CCE) or equivalent certification is a plus. Collaboration: Excellent communication more »
West London, London, United Kingdom Hybrid / WFH Options
Daniel James Resourcing Ltd
two. Demonstrates a keen understanding of AWS and other cloud costs, attributing them to specific teams and services. Possesses extensive knowledge and experience in observability, including best practices, implementations, and familiarity with observability vendors. Champions diversity and inclusion, fostering a culture of innovation, teamwork, and self-improvement. Leads by example more »
to join their London technology team. This is a new role for the team and they are looking to hire someone with very strong observability and monitoring skills. The technology stack is rapidly evolving across the company and all of the infrastructure and application stack is now being built using … Microservices architecture. The successful Production Engineer will be embedded within some of the core development teams, finding new ways to improve monitoring and observability workflows using tools like Prometheus and Grafana. This role will involve direct interaction with traders, portfolio managers and senior stakeholders across the business so strong leadership … and interpersonal skills are required for this position. Key Requirements: Expert scripting skills with Powershell, Bash, Python etc. In depth knowledge of monitoring & observability tools such as Promethus, Grafana and OpenTelemetry Strong knowledge of CI/CD tooling Experience with metrics and tracing instrumentation, such as LGTM and PromQL Knowledge more »
Reigate, Surrey, South East, United Kingdom Hybrid / WFH Options
Client Server
collaborate across product focussed Agile engineering teams to ensure the reliability, availability and performance of client facing services. Responsibilities will include managing and configuring observability platforms such as DataDog and PagerDuty to provide proactive monitoring of production (and other) environments, design and implementation of automation processes to drive efficiencies, leading … a similar SRE/Site Reliability Engineer position You have experience of running 24x7 services in the public cloud - Azure You have experience with observability tools such as DataDog and PagerDuty You have a good knowledge of Containerisation - Kubernetes, AKS You have strong scripting skills for automation, PowerShell or Python more »
About the job : We are seeking a dynamic and experienced Observability Engineer with expertise in any cloud, Grafana/Prometheus/Datadog Role & Responsibilities * Develop and improve instrumentation for monitoring and logging the health and availability of services. * Proactively monitor systems, networks, and applications to provide input in improving the more »