testing to identify and resolve bottlenecks. Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code. Monitor, review, and tune databases to ensure highavailability and performance. Collaborate with product engineering teams to design and build observable software. Required Skills and Experience: Proven experience in an SRE/DevOps/Platform Engineering … Azure Solutions Architect or Certified Kubernetes Administrator are a plus. Experience in database management and performance tuning, especially MSSQL. Employee Benefits: Opportunity to be part of a well-established, high-performance SaaS company with over 30 years of history. Excellent pension scheme and life insurance. Generous holiday allowance. Supportive environment emphasizing learning and development. Work with a passionate, highMore ❯
local network environments, including Dell servers and Dell switch configurations. Strong technical expertise in Microsoft Active Directory (AD), Windows Server environments, and authentication solutions. Plan for scalability, redundancy, and highavailability to support future growth. IT Security & Compliance: Ensure compliance with security and regulatory standards, including PCI DSS, Cyber Essentials+, DORA, and ISO 27001. Implement and enforce security … best practices across infrastructure automation and cloud environments. Maintain accurate compliance documentation, including PCI DSS scope records and security policies. Secure high-value and high-risk data, such as cardholder (PCI) and personally identifiable information (PII). Cloud & DevOps Integration (these tools and skills will be taught): Implement and manage Infrastructure as Code (IaC) for cloud and on … IT audits and compliance frameworks (CIS, PCI DSS, Cyber Essentials, NIST, ISO 27001). In-depth understanding of network security and compliance in regulated environments. Proven ability to secure high-value data (PCI cardholder data, PII) and implement security best practices. Strong networking knowledge (LAN, WAN, DNS, DHCP, VPN, TCP/IP). Proficiency in firewall and load balancer More ❯
PO15, Whiteley, Hampshire, United Kingdom Hybrid / WFH Options
Stratospherec Ltd
local network environments, including Dell servers and Dell switch configurations. Strong technical expertise in Microsoft Active Directory (AD), Windows Server environments, and authentication solutions. Plan for scalability, redundancy, and highavailability to support future growth. Cloud & DevOps Integration (these tools and skills will be taught): Implement and manage Infrastructure as Code (IaC) for cloud and on-premises environments. … IT audits and compliance frameworks (CIS, PCI DSS, Cyber Essentials, NIST, ISO 27001). In-depth understanding of network security and compliance in regulated environments. Proven ability to secure high-value data (PCI cardholder data, PII) and implement security best practices. Strong networking knowledge (LAN, WAN, DNS, DHCP, VPN, TCP/IP). Proficiency in firewall and load balancer More ❯
with development teams to integrate their projects into the AWS environment and ensure the system's performance aligns with business objectives and security standards. Responsibilities: Design and implement scalable, highavailability cloud infrastructure on AWS. Migrate existing on-premises applications to AWS cloud. Manage full AWS Lifecycle, Provisioning, Automation, and Security. Configure and fine-tune cloud infrastructure systems. More ❯
Gloucester, Gloucestershire, United Kingdom Hybrid / WFH Options
Leonardo UK Ltd
based engineering team working within a highly secure environment; with responsibility for providing a bridge between software development, testing, infrastructure, operations and system administrators to facilitate efficient, continuous and high-quality software and environment delivery. The processes and tooling implemented by the Principal DevSecOps Engineer will enable source code and associated artefacts to be installed and configured on scalable … code and supporting artefacts using appropriate tools. As required, lead on software/application development activities. Plan and undertake installations on development, test, reference and operational environments. Deliver consistent high-quality software and environmental builds through automation. Creation and maintenance of automaton frameworks for software and/or environment provisioning and ongoing operation. Managing and controlling software configuration for … projects including the source repository. Implement and maintain of Continuous Integration (CI) and/or Build pipelines where appropriate. Understand change control and release management practices. Ensure highavailability and security of systems through robust monitoring, logging, and alerting. Assist in data engineering activities (data cleansing, integration, onward data analytics). Provide application support to existing deployed services. More ❯
Kafka and Kubernetes Platform Management: Design, deploy, and maintain scalable Kafka and Kubernetes clusters to support development and production environments Implement best practices for Kafka and Kubernetes operations , ensuring highavailability, performance, and security Monitor, troubleshoot, and optimize Kafka and Kubernetes infrastructure to meet development team needs Implementation: Implement cloud infrastructure components, including compute, storage, networking, and security … for performance, scalability, and cost-efficiency Implement DevOps practices for streamlined deployment and operations Troubleshooting and Support: Provide technical support for cloud infrastructure and services Troubleshoot and resolve performance, availability, and security issues Support production environments and participate in a 24x7 on-call rotation when required Requirements: Experience 7+ years of experience in designing, implementing, and managing cloud-based More ❯
Responsibilities Azure Cloud Infrastructure: Build, maintain and improve web infrastructure hosted on Microsoft Azure with a focus on performance, security, scalability and cost-effectiveness. Web Hosting & Load Balancing: Support high-availability hosting environments, including web servers, WAFs, load balancers and DNS. Automation & IaC: Use tools such as Terraform, ARM templates or Bicep to manage infrastructure as code. Security More ❯
Responsibilities Azure Cloud Infrastructure: Build, maintain and improve web infrastructure hosted on Microsoft Azure with a focus on performance, security, scalability and cost-effectiveness. Web Hosting & Load Balancing: Support high-availability hosting environments, including web servers, WAFs, load balancers and DNS. Automation & IaC: Use tools such as Terraform, ARM templates or Bicep to manage infrastructure as code. Security More ❯
Maintenance and Upgrades: Regularly update and patch web servers to ensure they are secure and performing optimally. Monitoring and Performance Tuning: Monitor server performance and troubleshoot issues to ensure highavailability and reliability. Ensure compliance with Service Level Agreements (SLAs) by proactively identifying and resolving performance bottlenecks and potential issues. Security Management: Implement and maintain security measures to … protect web servers from threats and vulnerabilities. Backup and Recovery: Manage backup and recovery processes to ensure data integrity and availability in case of server failure. Documentation: Maintain accurate documentation of configurations, procedures, and policies. Collaboration: Work closely with development and IT teams to support web application deployments and ensure smooth operation. User Support: Provide support to users and More ❯
Key Responsibilities: • Design and optimize PostgreSQL database schemas, indexes, and queries for performance and scalability. • Monitor and tune database performance using native tools and custom scripts. • Implement and maintain highavailability (HA) and replication strategies. • Collaborate with developers to refine complex queries and stored procedures. • Automate routine tasks using scripting languages (e.g., Bash, Python). • Document database architecture More ❯
solutions and implementations Experience implementing developer self-service/developer experience portals Strong experience of application modernisation and cloud migration programs Strong Linux and Windows server experience in a high-availability 24/7 operation Experience with the development and deployment of large-scale, complex technology platforms Deep understanding of GCP products across database, serverless, containerization and API … Advanced level expertise in Terraform Extensive experience in designing and implementing DevOps practices Experience with two or more CI/CD solutions Experience coaching and mentoring high-performing teams Pragmatic experience using agile to deliver incremental value Experience working in a global or multinational team setting Strong documentation, communication and collaboration skills Proven ability to drive innovation and continuous More ❯
managing cloud infrastructure (AWS, Azure, Google Cloud) At least 3 years experience in working with kubernetes environments Proficient in managing and scaling Kubernetes clusters, including monitoring, troubleshooting, and ensuring highavailability Experience with cloud-native technologies, CI/CD pipelines, and containerization tools (e.g., Docker) Familiarity with data integration and management from multiple sources in a distributed system More ❯
methodologies. Proficient automating network infrastructure configuration using Software Defined Networking Ability to have a holistic view of service development and deployment architecture Understanding of SLA requirements and role of highavailability architectures, performance optimization, and monitoring techniques Act in the capacity to support deployment and operations when necessary Set up and configure a continuous integration environment by leveraging More ❯
and implementation of our Site Reliability Engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of Writer's critical systems, proactively guaranteeing that our high-ROI products reach customers seamlessly. Your responsibilities: Lead the design, implementation, and maintenance of Writer, Inc.'s cloud infrastructure to ensure highavailability and performance. Design and … reliability practices. Is this you? Proven expertise in Site Reliability Engineering with at least 7 years of hands-on experience. Deep understanding of system architecture and infrastructure design for highavailability and performance. Bachelor's degree in Computer Science, Engineering, or a related field. Strong proficiency in programming languages such as Python, Java, or Go for automation and More ❯
a Platform Engineer, you will bring your extensive experience in designing, building, and maintaining scalable and secure infrastructure solutions to deliver mission-critical results. Your contributions will ensure the availability, performance, and security of core platforms, meeting the high standards required in dynamic and sensitive environments. At GoldenTech, we believe in empowering our people to drive innovation and … pipelines to streamline deployments and enhance software reliability. Proactively monitor platform performance, identify bottlenecks, and optimize systems. Manage cloud-based and on-premise infrastructure with orchestration tools to ensure highavailability and disaster recovery readiness. Implement robust security practices to protect data, applications, and services. Troubleshoot and resolve complex platform issues to minimize downtime and improve reliability. Qualifications More ❯
Our client, a fast-growing technology organisation with global operations, is seeking a proactive Infrastructure & Systems Engineer to join their high-performing technical operations function. This person will take ownership of both on-premise and cloud-based infrastructure, playing a key role in modernising legacy systems while maintaining the performance and resilience of existing services. The organisation maintains a … demanding operational environment with 24/7 availability standards and a commitment to service excellence. The successful candidate will be instrumental in maintaining core IT infrastructure, contributing to long-term strategy, and supporting the stability of mission-critical systems. This role is well-suited to someone with strong technical problem-solving capabilities and the ability to balance hands-on … engineering with strategic thinking. Key Responsibilities Lead the development and support of enterprise-wide systems infrastructure, including servers, storage, networks, and virtualisation layers. Build and maintain secure, high-availability environments across cloud and physical platforms, ensuring performance, capacity, and security align with business demands. Deliver both project-based infrastructure implementations and BAU support, contributing to disaster recovery, capacity More ❯
robust microservices-based solution, enabling agility, scalability, and independent service deployments. Define and own the solution architecture for the product, ensuring scalability, configurability, and cloud-agnostic capabilities. Develop HLD (High-Level Design) and LLD (Low-Level Design) documents for the product. Create and maintain the deployment architecture , ensuring efficient and resilient deployment strategies. Design the integration architecture , including APIs … first design . Cloud Platforms: Deep knowledge of key cloud players AWS, Azure, and GCP , ensuring cloud-agnostic design principles. Scalability & Performance Optimization: Expertise in designing scalable, distributed, and high-availability systems. DevOps & CI/CD: Knowledge of Kubernetes, Docker, Terraform, Ansible , and other infrastructure automation tools. Security & Compliance: Understanding of cloud security, management, and regulatory compliance (GDPR … a related field. 6+ years of experience in software architecture and design. Proven experience as a Solution Architect in SaaS-based or cloud-agnostice products . Strong background in high-scale distributed systems, API design, and cloud platforms . Experience in leading architecture for a multi-tenant SaaS or large enterprise application. Certifications: AWS Certified Solutions Architect, Google Professional More ❯
party services. Oversee API development with product owner and ensure best practices in service-oriented architecture. Team Leadership & Collaboration: Work closely with engineering, DevOps, and support teams to deliver high-quality solutions. Facilitate agile ceremonies, including backlog grooming, sprint planning, and retrospectives. Act as the primary liaison between technical teams and business stakeholders. Operational Excellence & Continuous Improvement: Ensure highavailability and reliability of the platform and applications, implementing monitoring and automation as needed. Identify areas for improvement and drive initiatives for performance optimization. Maintain compliance with security, data protection, and industry standards. Vendor relationship management: Manage the relationship with vendor(s) and hold them contractually accountable for all services provided. Qualifications Required Qualifications: Education & Experience: Bachelor's More ❯
customer-impacting issues within the day-to-day role to management. *How You'll Do It* * Analyze, design, code, test, and deploy new user stories and product features with high quality (security, reliability, operations) to production. Understands the software development lifecycle and leverages critical thinking skills to properly evaluate features and functionality. * Guides early-career engineers by providing learning … design decisions and guides team to achieve key results for products assigned to them. * Remediates issues using engineering principles and creates proactive design solutions for potential failures to ensure high reliability of technical solutions. * Achieves team commitments (and influence others to do the same) through collaboration with other engineers, architects, product owners and data scientists. * Contributes to and leads … Driven Development (TDD) * Experience with Behavior Driven Development (BDD, Cucumber test framework) * Experience writing unit and service level tests to ensure adequate code coverage (JUnits) * Drools * Proven skills in highavailability and scalability design, as well as performance monitoring * Experience developing and implementing API service architecture * Experience in working with a relational database * Experience provisioning infrastructure with Terraform More ❯
customer-impacting issues within the day-to-day role to management. *How You'll Do It* * Analyze, design, code, test, and deploy new user stories and product features with high quality (security, reliability, operations) to production. Understands the software development lifecycle and leverages critical thinking skills to properly evaluate features and functionality. * Guides early-career engineers by providing learning … design decisions and guides team to achieve key results for products assigned to them. * Remediates issues using engineering principles and creates proactive design solutions for potential failures to ensure high reliability of technical solutions. * Achieves team commitments (and influence others to do the same) through collaboration with other engineers, architects, product owners and data scientists. * Contributes to and leads … Driven Development (TDD) * Experience with Behavior Driven Development (BDD, Cucumber test framework) * Experience writing unit and service level tests to ensure adequate code coverage (JUnits) * Drools * Proven skills in highavailability and scalability design, as well as performance monitoring * Experience developing and implementing API service architecture * Experience in working with a relational database * Experience provisioning infrastructure with Terraform More ❯
Bristol, Gloucestershire, United Kingdom Hybrid / WFH Options
CACI Limited
protection, regulatory compliance, and alignment with industry best practices (e.g., AWS IAM, encryption, VPC, security monitoring, and auditing). • Containerisation & Orchestration: Architect and oversee containerised environments using Kubernetes, ensuring highavailability, scalability, and fault tolerance for critical applications. • Event-Driven Systems: Lead a team to architect event-driven systems using Kafka, designing and managing messaging frameworks to handle … Certified Security - Specialty) preferred. • Understanding of architectural standards and frameworks e.g. TOGAF Due to the industries we work in, we require the successful candidate to be able to obtain high level security clearance. To qualify for this, you must be a British citizen and have lived permanently in the UK for the last 5 years. Why work for us More ❯
the infrastructure engineers: VMware Infrastructure Design and deployment of large-scale Vmware environments Configuration and management of ESXi, vCenter, vSAN, and NSX Implementation of RBAC for Vmware access control Highavailability, disaster recovery, and backup strategies Operating Systems Deployment, configuration, and management of Linux (various distributions) Windows Server setup, including Active Directory, DNS, and Group Policy Linux repositories More ❯
recovery projects. People: Management and growth of engineers - through 1:1s, performance reviews and objectives, it is important for all that we are able to deliver work of a high standard, in a sustainable manner, and engineers are able to learn, develop and grow their skills and career. Collaborate closely with other engineering teams, product managers, and business leaders … to align infrastructure capabilities with business needs and growth. What we're looking for Technical Experience An expert in modern infrastructure technology with experience in high-availability cloud platforms for SAAS companies. Experience with our specific tech stack is preferred. Understanding of regulatory frameworks like GDPR, ISO27k etc. An advocate for AI technologies and constantly stays up to More ❯
containerisation (Docker, Kubernetes) and cloud platforms (AWS, GCP or Azure) Skilled in cross-functional collaboration and stakeholder communication Strong analytical skills with a proactive, problem-solving mindset Experience in high-availability systems, cybersecurity frameworks (ISO, SOC), or Elixir development Background in fast-paced, start-up or scale-up environments Interest in stepping into or growing towards an Engineering More ❯
Oversee the full lifecycle of enterprise applications from ideation to deployment and ongoing support. Design and ensure seamless integration of applications across various platforms and systems, ensuring interoperability and high availability. Work with technical teams to design integration points, leveraging APIs, microservices, and cloud platforms for efficient communication between disparate systems. Lead the design and implementation of cloud-native … applications or hybrid solutions leveraging platforms like AWS, Azure, or Google Cloud. Ensure application solutions are optimized for cloud environments, implementing strategies for application scalability, security, and high availability. Guide teams in the adoption of cloud-based services and platforms, ensuring seamless migration of applications to cloud infrastructure. Document the architecture of enterprise applications, including technical specifications, process flows More ❯