systems, support and optimize CI/CD pipelines, and determine optimal solutions for the company’s products. You’ll collaborate closely with development, DevOps, and other teams to maintain high uptime, security, and user experience standards for millions of endpoints. Experience and Education: Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical … Observability tools (New Relic, DataDog, Splunk) Scripting (Ansible, Bash, Python, GO) CI/CD Primary Job Responsibilities: Design and support EC2/ECS/EKS/Fargate environments for highavailability and fault tolerance. Implement advanced AWS features (Route53, ALB/NLB, multi-region setups) to ensure global reliability. Maintain and optimize the existing CI/CD pipelines … real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends. Work closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance. Ensure business continuity by designing and maintaining robust backup, failover, and disaster recovery solutions. #J-18808-Ljbffr More ❯
based on performance (Long-term project) Job Responsibilities Design, develop, and maintain robust and scalable backend systems using .NET Core and C#. Build and maintain microservices-based architectures, ensuring highavailability, performance, and scalability. Develop and integrate RESTful APIs and web services to support front-end and external systems. Implement event-driven solutions using messaging systems like Apache More ❯
orchestration and managing stateful services. * Proficiency in Linux systems and Shell Scripting. * Hands-on experience with Terraform, Helm, and CI/CD pipelines. * Proven experience with building or supporting highavailability, mission-critical systems. * Strong grasp of authentication and security concepts (OAuth, SAML, OpenID, SCIM, Kerberos). * Familiarity with monitoring tools, agent-based architectures, alerting, and dashboard creation. … to cutting-edge technologies in the cloud-native, data infrastructure space. * Join a collaborative team working across global offices. * Access to ongoing training, progression, and mentorship in a complex, high-impact environment. * Work in a centrally located office with excellent on-site amenities, including gym and restaurant. Interested in delivering high-impact infrastructure at scale? Apply now and More ❯
on, cross-functional, and central to our product and research success. Key Responsibilities DevOps & Infrastructure Design, implement, and maintain infrastructure on AWS and Google Cloud Platform (GCP) to support high-performance computing workloads and scalable services. Collaborate with R&D teams to provision and manage compute environments for model training and experimentation. Maintain/monitor systems, implement observability solutions … e.g., logging, metrics, tracing), and proactively resolve infrastructure issues. Manage CI/CD pipelines for rapid, reliable deployment of services and models. Ensure highavailability, disaster recovery, and robust security practices across environments. Data Engineering Build and maintain data processing pipelines for model training, experimentation, and analytics. Work closely with machine learning engineers and researchers to understand data … ingestion, transformation, and storage using tools such as Scrappy , Playwright , agentic workflows (e.g. crawl4a i) or equivalent. Optimize and benchmark AI training/inference/data workflows to ensure high performance, scalability, cost and an exceptional customer experience. Maintain data quality, lineage, and compliance across multiple environments. Key Requirements 5+ years of experience in DevOps , Site Reliability Engineering , or More ❯
our client's data and our own content is a key differentiator. Your responsibilities will include, but are not limited, to: Software development and integration Design, develop, and maintain high-quality software solutions using Microsoft Azure services and other appropriate tooling. • Implement and manage integrations between various systems and platforms including MS PowerPlatform and third party legal-tech solutions. … with the ability to troubleshoot and resolve complex technical issues. Communication Excellent verbal and written communication skills, capable of conveying technical concepts to non-technical stakeholders. Attention to detail High level of attention to detail to ensure code quality and accuracy. Agility and adaptability Ability to work effectively in a team environment and collaborate with cross-functional teams and … Tools: Experience with development tools and methodologies such as Agile, Scrum, and DevOps practices. Should have: • Knowledge and interest in data science and analytics. • Experience with sizing, performance and highavailability • Ability to absorb large amounts of information and create sense and simplicity. • Confidence and fluency in the language and tools of agile, lean and design • Expertise and More ❯
Proactively monitor and report on system capacity and performance. Provide 2nd and 3rd line technical support for Linux and IBM-Power platforms. Lead and contribute to infrastructure projects, delivering high-quality solutions aligned to business needs. Ensure availability of mid-range platforms, resolving service-affecting issues as necessary. Implement best practices across Linux platforms to meet availability … sites and participate in out-of-hours support as part of a rota (37.5 hour week). IBM Power, AIX, VIO, NIM, CMC/HMC administration. Designing and supporting highavailability architectures. Experience with public cloud environments (Azure and/or AWS). Job scheduling tools such as Redwood Cronacle/RunMyJobs. Understanding of project methodologies such as More ❯
Glasgow, Lanarkshire, United Kingdom Hybrid / WFH Options
Scope AT Limited
infrastructure. This is a hands-on engineering role, requiring a mix of database product expertise and development skills, particularly in Python. You will play a key part in building high-performance, highly available data solutions with a strong emphasis on automation, security, and operational efficiency. Key Responsibilities Design and deploy secure, compliant infrastructure integrated with organizational controls. Build and … Infrastructure as Code tools. Provide architecture input for highly available, production-grade systems. Document and optimize support processes for Postgres-based services. Develop monitoring and alerting systems to ensure highavailability and performance. Key Skills & Experience Strong Python development skills (essential) Hands-on experience with Postgres in production environments Expertise in Kubernetes and container orchestration Solid Linux system … authorization standards (SAML, SCIM, OAuth, OpenID, Kerberos) Strong understanding of DevOps practices and Agile methodologies Ability to develop and manage system monitoring tools and dashboards Experience with large-scale, high-velocity OLTP systems and NewSQL architecture Additional Information This role is offered on a PAYE basis , excluding holiday pay accrual. Candidates must have access to their own device for More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
includes: Designing and maintaining modern CI/CD pipelines (GitHub Actions experience is a big plus) Implementing Infrastructure as Code (Terraform) Supporting deployments of PHP/Laravel applications in high-concurrency environments Working with Docker, Kubernetes, ECS or EKS Automating development workflows and driving performance optimisations Building out monitoring solutions, cost management strategies, and SOC2-compliant processes ? Skills & Experience … a DevOps-focused role Strong cloud background (AWS, Azure, or GCP) Proficiency in Terraform, Docker, Python or Bash scripting Solid experience with infrastructure performance, security, and scaling Comfortable in high-availability, fast-paced environments Understanding of SOC2 compliance within DevOps workflows If you're passionate about automation, performance, and scalable systems and love solving problems with a proactive More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
commercial broadcast radio and digital audio company, leading the industry across nine markets, including the UK. Our innovative approach ensures we remain at the forefront of audio advertising, delivering high-quality content and pioneering solutions to millions of listeners. Our mission is to engage audiences with compelling audio experiences, driven by data and innovation. Our Team: SharpStream is part … our team of 10, we’re looking for an 11th member to join us as a Linux DevOps Engineer . You will play a critical role in ensuring the highavailability and performance of Sharp-Stream’s audio streaming platform by designing, maintaining, and optimising our Linux-based infrastructure. This includes modernising systems into a container environment, automating … processes to help maintain our 99.995% SLA, and managing high-visibility, production-critical systems. This role offers significant autonomy, with the freedom to innovate within a smaller, tightly knit team, and provides exposure to a wide variety of technologies and clients in the audio space Key Responsibilities: Design and implement scalable Linux systems using infrastructure as code (primarily Ansible More ❯
and maintain core systems and infrastructure to ensure optimal performance and reliability. Deploy and manage operating systems, applications, databases, and system management tools. Take a proactive approach to maintaining highavailability across systems and services. Analyze application performance, troubleshoot issues, and collaborate with developers to resolve potential bottlenecks. Develop and implement strategies for system security, data backup, and More ❯
and maintain core systems and infrastructure to ensure optimal performance and reliability. Deploy and manage operating systems, applications, databases, and system management tools. Take a proactive approach to maintaining highavailability across systems and services. Analyze application performance, troubleshoot issues, and collaborate with developers to resolve potential bottlenecks. Develop and implement strategies for system security, data backup, and More ❯
Glasgow City, Scotland, United Kingdom Hybrid / WFH Options
Montash
Splitters, Aggregators, and WireTaps. Ensure batch reliability and transactional integrity across integrations. Oversee CI/CD pipelines and infrastructure automation. Deploy and scale integrations with Docker and Kubernetes. Ensure highavailability and resilience across all messaging components. Apply secure practices using encryption, OAuth2, JWT, and TLS. Monitor integrations with audit trails, retries, DLQs, and reprocessing logic. Required Skills More ❯
administration, user support, and development. Main Duties and responsibilities 1. Perform daily operations and maintenance tasks for Windows and Linux systems supporting TRE in a secure enterprise environment, ensuring highavailability and performance of research services and infrastructure. 2. Manage TRE server infrastructure including liaising with Vendor Support (e.g., Dell, Broadcom, DataVita, Vyatta), Network Connectivity (University Information Services … 7. Managing ESXi (Dell PowerEdge) server resources, including modification of RAM, GPU, CPU, Storage Disks when necessary to ensure uninterrupted service to the users. 8. Monitoring performance, health, and availability of systems for TRE Workspace services ensuring minimal downtime and optimal user experience. 9. Support TRE Workspace services user requirements definition and tool deployment including RStudio, Jupyter Notebooks, Python More ❯
Champion diversity, equity, inclusion, and respect within the firm. Possess deep expertise in cloud implementations and security controls. Adopt newer technologies including AI and ML. Implement reliable platforms with highavailability (three nines or more). Maintain a problem-solving mindset. Required Qualifications, Capabilities, and Skills Formal training or certification in software engineering and extensive applied experience. Experience More ❯
more than just a support role - you'll be at the heart of designing, maintaining, and evolving our infrastructure across global datacentres and local sites. If you thrive in high-impact environments and enjoy solving complex technical challenges, we want to hear from you. What You'll Do Lead the design, support, and maintenance of critical infrastructure services. Ensure … highavailability, scalability, and compliance across global systems. Own the lifecycle of server environments (Rocky Linux, Windows Server, VMWare/Nutanix). Drive major infrastructure projects - from refreshes to site migrations. Act as a senior escalation point for technical issues and service requests. Collaborate with business units to modernize and secure IT architecture. Maintain documentation and inventory and More ❯
more than just a support role — you’ll be at the heart of designing, maintaining, and evolving our infrastructure across global datacentres and local sites. If you thrive in high-impact environments and enjoy solving complex technical challenges, we want to hear from you. What You’ll Do As a key member of our infrastructure team, you will lead … the design, support, and maintenance of critical infrastructure services, ensuring highavailability, scalability, and compliance across our global systems. You will take ownership of the full lifecycle of server environments, including Rocky Linux, Windows Server, and VMWare/Nutanix platforms. Your role will involve driving major infrastructure projects such as hardware refreshes and site migrations, while also serving More ❯
strong leadership skills to drive the team towards achieving business objectives. You will collaborate with cross-functional teams including architecture, product, and software engineering to ensure the delivery of high-quality data solutions aligned with company goals. Requirements: 5+ years of hands-on experience in data engineering, including expertise in Python, Scala, or Java. Deep understanding of Apache Kafka … while maintaining performance and reliability. Collaborate with stakeholders to gather requirements, propose designs, and align data strategies with business objectives. Ensure system reliability and scalability, with a focus on highavailability and robust data transfer mechanisms (e.g., "at least once" delivery). Stay up-to-date with emerging technologies and evaluate their potential application to improve the overall More ❯
for deployed analytics solutions, including troubleshooting and resolving issues related to data accuracy, system performance, and user access. Monitor system health, proactively identify, and resolve potential bottlenecks, and ensure highavailability and reliability of the analytics environment. Security and Governance: Implement and enforce data security controls, ensuring compliance with regulatory requirements and industry standards. Develop and maintain data … and privacy within the analytics ecosystem. Collaboration and Knowledge Sharing: Collaborate effectively with stakeholders, including business users, data analysts, and IT teams, to understand requirements, gather feedback, and deliver high-quality solutions. Share knowledge and provide guidance to team members on OCI OBl Analytics best practices and emerging trends. Continuous Improvement: Stay up-to-date with the latest advancements More ❯
has extensive backend development experience with Java and understands what its like to work in an ever changing, Agile environment. What you’ll do: Key technical member of a high performing Claim's Agile squad accountable for shaping our products and customer journeys An expert developer, working with developers to design and build customer facing systems using Java microservices … and mentor. Improving the engineering practices in the squad including code quality, testing and CI/CD Working with SRE engineers to build distributed systems with a focus on highavailability, low latency and scalability (you build it, you run it) Senior member of esure’s engineering community, engaging in tech forums and driving tech change wider than More ❯
on proactively managing the UMAX application environment - minimising disruptions, identifying recurring issues, recommending process improvements, and actively engaging with key users to optimize their experience. You will help ensure highavailability, fast response times, and seamless user interaction with the platform. UMAX is the brand name of our software solution for water and energy utilities. It automates their More ❯
Glasgow, Scotland, United Kingdom Hybrid / WFH Options
JR United Kingdom
to teamwork and improvements, all based on industry best practices. The person will also be on-call on rotational basis during which they will be required to work on high priority incidents and other planned activities. L3 support for any BAU task (incidents/service requests/changes) ensuring SLAs are met. Assisting in resolving high priority or … Candidates should have expert working knowledge of: IBMi BRMS Backup and Restore. 3rd Party System Management and Job Scheduler Package experience (Robot, Revsoft, Halcyon). LPAR configuration. Exposure to HighAvailability Software – Management of Mimix/ICluster Software. Administration and implementation experience from 7.3 to 7.5. CL coding capability. VIOS Server skills SEA and NPIV understanding. System Firmware More ❯
with Cisco, Checkpoint and Palo Alto products Proven expertise in ACLs, network segmentation, and perimeter defence Deep understanding of Layer 2/3 networking, routing protocols (BGP, OSPF), and highavailability Strong troubleshooting skills across routing and switching protocols Excellent communication and teamwork skills At Sword, our core values and culture are based on caring about our people More ❯