Swindon, England, United Kingdom Hybrid / WFH Options
Edenred Finland Oy
services on the AWS stack such as EC2, Route53, S3, RDS, Dynamo DB, SNS, SQS, IAM, while focusing on high-availability, fault tolerance, and auto-scaling. Setup AWS infrastructure monitoring through CloudWatch and applicationperformancemonitoring through App Dynamics. Why PayTech? We are a subsidiary of the Edenred group and leaders in prepaid solutions including banking … Complimentary fruit and other ‘in office’ snacks & refreshments Volunteering programme Social events Pension Scheme – employer 6% with minimum employee contribution 3% Discretionary bonus scheme based on company and personal performance Holiday trading scheme (buy or sell up to 5 days) Medical & international travel cover (leisure and action sports) Season ticket loan Cycle to Work scheme Diversity: Edenred UK and More ❯
London, England, United Kingdom Hybrid / WFH Options
Docebo
and Escalation process; Monitor metrics and develop ways to improve observability; Diagnose & troubleshoot service incidents & outages, with the capability to do urgent code fixes when needed on the various application services; Day by day operations to maintain and evolve the Docebo multi-tenant infrastructure; Build and maintain effective working relationships with colleagues, technical staff, and customers to identify issues … incidents, problems, and system changes; Keep your eyes and ears open for ways we can further streamline our development process. Perform periodic on-call duty, maintaining the availability and performance of the Docebo Platform. Requirements: Experience with infrastructure management and operations; Good working knowledge of Linux and Docker, and familiar with LAMP stack; Experience with AWS Cloud services; Experience … with Bash or Python scripting; Experience with CI/CD tools such as Gitlab CI; Familiar with applicationperformancemonitoring tools like Datadog, New Relic; Familiar with Docker orchestrators such as Amazon ECS or Kubernetes; Familiar with Git; Ability to solve issues with clear methods while knowing when to take intuitive leaps. Nice to have Knowledge of More ❯
unique place to work. HOW WILL YOU MAKE A DIFFERENCE? We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This role focuses on maintaining and improving system observability, automating operations, and enhancing deployment practices to support business-critical services. Reporting directly to … the Lead Site Reliability Engineer, you will be expected to work independently while collaborating closely with engineering and operations teams. You will be responsible for implementing and maintaining monitoring and logging solutions while producing clear documentation to support the cloud environment. Continuous learning and improving performance based on set targets will be expected. VARIED DAY TO DAY RESPONSIBILITIES … Ensuring system reliability, performance, and scalability through monitoring and automation Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry Proactively identifying and resolving performance bottlenecks and infrastructure issues Automating infrastructure provisioning, configuration management, and deployments Implementing effective logging, monitoring, and alerting strategies Managing incident response and post-mortem processes to improve system resilience Implementing high More ❯
business areas of the company effectively. The client needs people with solid experience and understanding of product and services development, from initial design, through iterative development cycles, tuning for performance and scalability, and pre- and post-sales support integrating with customer platforms. Skills Core product built on IBM Cloud, using Cloud Foundry, Postgres database services, Watson NLU and Visual … Recognition, Cloud Object Storage, Cloud Functions, Kafka streams, DevOps Toolchains, Akamai CDN Node.js – for REST service APIs and interactive UI (using Vue.js) Platform monitoring and alerting with Sysdig and LogDNA Integration with news products on AWS platform (using S3, Lambda functions, DynamoDB/Kinesis streams, Postgrad database service, CloudFormation) Other things they would like to see: Unit and integration … testing experience (TDD and BDD even better) Applicationperformancemonitoring and optimisation Integration of 3rd party components and services Experience with NoSql databases and caches (e.g. Redis) Familiarity with microservices architectures, containers and stream processing Any experience with other programming languages (e.g. Java, C++, Python) Package Description 25 days and bank holidays Pension plan Company cycle/ More ❯
and supported Update our software dependencies as required, including language and framework upgrades Identify when our test coverage is lacking and enhance as appropriate Maintain, update and build our application containers Deploy to our non-production and production environments through our deployment process Build and maintain a personal knowledge base of our infrastructure and associated technologies Monitor our infrastructure … framework Proficient at Javascript (native and jQuery) Proficient at HTML, CSS & JavaScript preprocessors (HAML/SASS/SCSS) Experience of maintaining and operating a production Ruby on Rails SaaS application Experience of working in a busy, multi-functional and collaborative development team Experience of working in an agile development environment Experience of planning and prioritising a variable workload and … delivering to deadlines Experience with using source code management, such as Git and GitHub, as part of a collaborative workflow Experience with ApplicationPerformanceMonitoring tools such as AppSignal and New Relic to identify and investigate issues Experience in implementing scalable REST APIs using Ruby on Rails Working in technology in a changing industry means an ability More ❯
by simplified data access and management at scale with AI. These solutions include Asset and Risk intelligence, Attack Analysis, Orchestration Automation and Response, User Behavior Analytics, SIEM Enterprise Security, ApplicationPerformanceMonitoring, Infrastructure Monitoring, Log Analysis, Incident Response, Network Monitoring, Business Risk Observability, AIOps, Digital Experience Monitoring and the ecosystem continues to expand and More ❯
by simplified data access and management at scale with AI. These solutions include Asset and Risk intelligence, Attack Analysis, Orchestration Automation and Response, User Behavior Analytics, SIEM Enterprise Security, ApplicationPerformanceMonitoring, Infrastructure Monitoring, Log Analysis, Incident Response, Network Monitoring, Business Risk Observability, AIOps, Digital Experience Monitoring and the ecosystem continues to expand and More ❯
ll have the unique opportunity to shape our SRE strategy, establish best practices, and set the standard for service reliability and performance. What You’ll Do Define strategies for ApplicationPerformanceMonitoring, Unit Cost, and Chaos Engineering. Continuously optimize production environments to enhance reliability and efficiency. Implement and apply MTTR, SLO, and SLI principles to ensure high … A collaborative mindset – you thrive on helping others and driving company-wide impact. Nice to Have Experience working in regulated industries (e.g., PCI compliance). Background in capacity planning, performance, and load testing. Sysadmin skills for troubleshooting disk, network, and infrastructure issues. Why Join Thredd? The chance to define and lead SRE best practices from the ground up. A More ❯
advertising and other commercial components across a range of digital products. Requirements We are looking for someone with solid experience and understanding of web development (Frontend UX and deeper application development) and ad tech. The tech stack includes: HTML5 and CSS3 Mocha, Karma, Chai, Sinon, Nightwatch, Jest Drone CI, Docker Google Ad Manager (formerly Doubleclick for Publishers, DfP) Escenic … Java-based CMS Other things we’d like to see Unit and integration testing experience (TDD and BDD even better) Web applicationperformancemonitoring and optimisation A good understanding of ad serving concepts and ecosystem Use of Ad Manager/DfP – via the publisher console and APIs, template and wrapper code Integration of 3rd party components and … services Capture and use of data to inform product development (Web Analytics, Real User Monitoring, A/B and Multivariate testing, Ad Manager reporting tools, etc) Exposure to near real-time; creation of reporting and data visualisation tools Experience with other programming languages (e.g. Java, C++, Python) Package Description 25 days + 8 bank holidays Company cycle/car More ❯
Leveraging cloud native tools and enabling the developers to instrument, analyse, and monitor the application. Permanent position, Hybrid in London. Responsibilities Responsibilities involve designing, deploying, and maintaining cloud-native monitoring services that are both elastic and resilient to failure across AWS. It is also fundamental to establish standards and best practices for the instrumentation of container-based services and … The maintenance of their pipeline is key to ensure that notifications are well-timed, accurate, and directed to the appropriate channels. Automation is a priority, as it allows the monitoring platforms to scale smoothly, promoting a self-service approach. Requirements • Strong Infrastructure as Code skills, ideally with Terraform and Kubernetes. • Strong knowledge of modern logging tool sets, including Logstash … or Fluentd. • Understanding of Prometheus and its ecosystem, including Alertmanager. • Good knowledge of ApplicationPerformanceMonitoring tools and crash reporting tools, such as Sentry. • Good knowledge of cloud provider managed services, and how they can be leveraged in our context. • Ability to write high quality code in Python, Go, or equivalent languages. #J-18808-Ljbffr More ❯
optimising efficient and scalable systems to support the backend infrastructure for our mobile game applications. This role involves designing and implementing server-side logic, managing databases, and ensuring game performance for large numbers of players. As a member of the backend team, you will work closely with other Server Programmers, Data Engineers, and Designers to assist in code design … for the efficient storage, access, and management of game data. You will also implement security measures to protect our games from potential threats. You will work to improve server performance, reduce latency, and handle large player loads, sometimes analysing player behaviour to identify bottlenecks. We are seeking an exceptional Server Programmer to join our incredibly talented team and contribute … based infrastructure to support large-scale mobile games. Design and implement RESTful APIs accessible by client applications. Assist in making technology decisions and integrating third-party services. Monitor code performance and optimise existing code solutions. Create and maintain technical documentation. Essential skills Approximately 3-5 years of experience in a similar role. A good understanding of web development and More ❯
ensuring that our service offerings are scalable, cost-effective, and aligned with industry best practices . What You'll Love Architect and enhance Asda's enterprise observability solutions, including ApplicationPerformanceMonitoring, Logging, Monitoring & Alerting, and Dashboarding . Define and optimize incident management processes , focusing on tooling, integrations, automation, and evolving ServiceNow solution design . Support … Data Architect locations Asda House time type Full time posted on Posted 9 Days Ago time left to apply End Date: June 28, 2025 (5 days left to apply) Application Solution Architect locations Asda House time type Full time posted on Posted 5 Days Ago time left to apply End Date: July 2, 2025 (9 days left to apply More ❯