Our DoD/Aerospace client is looking to fill a SiteReliabilityEngineer (SRE) - Advanced Software Engineer position that just opened in Scottsdale, AZ. In this role, as the SiteReliabilityEngineer (SRE) for Space and Intelligence Systems line of business, you will be a member of a cross functional team responsible for … maintaining survivability and reliability of mission critical resources. SREs monitor high priority systems and automate recovery mechanisms to ensure they remain operational for the warfighter. To be considered, Requires a Bachelor's degree in Software Engineering, or a related Science, Engineering or Mathematics field. Also requires 5+ years of job-related experience, or a Master's degree plus … eligibility requirements for access to classified information. Due to the nature of work performed within our facilities, U.S. citizenship is required. As a SiteReliabilityEngineer (SRE) for our clients Space and Intelligence Systems line of business, you will be a member of a cross functional team responsible for maintaining survivability and reliability of mission critical More ❯
SiteReliabilityEngineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
SiteReliabilityEngineer £65,000 £95,000 DOE Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
SiteReliabilityEngineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits)Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
Bristol, Avon, South West, United Kingdom Hybrid / WFH Options
Twinstream Limited
SiteReliabilityEngineer | £65,000–£95,000 DOE | Hybrid (Bristol-based, occasional site visits) Clearance: Must be eligible for DV Clearance Founded in 2019 by engineers solving complex cross-domain problems for government organisations, TwinStream delivers technical excellence and exceptional service to high-profile clients. Our teams work both on-site and remotely … supporting mission-critical systems where performance and reliability are paramount. The SiteReliabilityEngineer Role: We are seeking a SiteReliabilityEngineer (SRE) to ensure the availability, performance, and cost-effectiveness of our cloud and on-prem services. You will collaborate with software engineers and system administrators to improve observability, reduce downtime, and … proactively mitigate reliability risks across a growing portfolio of services. Key Responsibilities of the SiteReliabilityEngineer: Improve reliability and performance across multiple subsystems. Automate manual tasks and eliminate unnecessary alerts. Enhance monitoring capabilities to identify and resolve issues before they impact users. Support and optimise CI/CD pipelines and cloud infrastructure. Research and More ❯
Junior SiteReliabilityEngineer … We are currently working with a leading Financial Services company, who are looking for a Junior SiteReliabilityEngineer to join their ever-expanding platform/SRE team from their Shoreditch, London, Office where you will be expected to travel to the office 4 days a week. They are looking for you to have excellent cloud knowledge … ideally AWS as well as having experience of Powershell/Python. As the Junior SiteReliabilityEngineer, you will be a self-starter who has excellent stakeholder management experience who can show outcome based work. You will ideally have 2 years of commercial experience coming from an IT Operations/Cloud infrastructure background. Please note this is More ❯
Farnborough, Hampshire, England, United Kingdom Hybrid / WFH Options
Addition
SiteReliabilityEngineer (Defence) This is a chance to join a forward-thinking digital solutions business delivering secure technology for the Defence and Security sector. As a SiteReliabilityEngineer, you’ll be at the heart of building, scaling, and maintaining critical platforms that underpin mission-ready technology. Role Overview: Role: SiteReliabilityEngineer Location: Hybrid, 3 days per week in Farnborough Package: £60,000- £70,000 per annum Benefits Industry: Defence & Security What You’ll Be Doing: Designing and maintaining Kubernetes environments for scalable deployments. Building and optimising CI/CD pipelines to improve efficiency. Implementing monitoring systems to ensure reliability and performance. Driving automation initiatives to reduce manual … in security, maintainability, and scalability. Staying ahead of emerging technologies to keep the platform cutting-edge. Main Skills Needed: Applications must be eligible for Security Clearance. Proven experience in SiteReliability or Platform Engineering (5+ years). Strong knowledge of Kubernetes and container orchestration. Expertise in CI/CD tools (Jenkins, GitLab, etc.). Experience with AWS is More ❯
SiteReliabilityEngineer (International Travel) This is a chance to join a forward-thinking digital solutions business delivering secure technology for the Defence and Security sector. As a SiteReliabilityEngineer, you’ll be at the heart of building, scaling, and maintaining critical platforms that underpin mission-ready technology. Role Overview: Role: SiteReliabilityEngineer Location: 1-2 days per week in Farnborough International Travel (Project based) Package: £60,000- £70,000 per annum Benefits Industry: Defence & Security What You’ll Be Doing: Designing and maintaining Kubernetes environments for scalable deployments. Building and optimising CI/CD pipelines to improve efficiency. Implementing monitoring systems to ensure reliability and performance. … the platform cutting-edge. Main Skills Needed: Applications must be eligible for Security Clearance. Happy to travel internationally according to project requirements (All costs covered). Proven experience in SiteReliability or Platform Engineering (4+ years). Strong knowledge of Kubernetes and container orchestration. Expertise in CI/CD tools (Jenkins, GitLab, etc.). Experience with AWS is More ❯
Check out this new opportunity! SiteReliabilityEngineer (SRE) Scottsdale, AZ Contract Position Pay Estimated: $62.36-$67.36/Hour This estimate represents the typical salary range for this position based on experience and other factors, (geographical location, etc.) SiteReliabilityEngineer (SRE) job in Scottsdale, Arizona is available with Moseley Technical Services Top Space …/Defense company. As part of the Space and Intelligence Systems line of business, you will be a member of a cross functional team responsible for maintaining survivability and reliability of mission critical resources, monitoring high priority systems and automating recovery mechanisms to ensure they remain operational for the warfighter. Responsibilities: Ensure system uptime demonstrated by diagnosing and triaging More ❯
SiteReliabilityEngineer (SRE) Active Secret Clearance Required Direct Hire Onsite at Langley AFB, Hampton, VA Various Shifts Available ALTA IT Services has a contract to hire opening for two SiteReliability Engineers SRE's to support mission-critical DOD programs, onsite at Langley AFB, Hampton, VA. Various shifts are available. An active Secret clearance … is required. In this role, you will focus on ensuring the availability, reliability, and performance of a multi-tenant, microservices application suite. You will collaborate closely with cross-functional teams to troubleshoot issues, automate processes, and build scalable, resilient systems. You will learn the nuances of the entire suite of applications and their infrastructure, which will facilitate your missions … solutions to prevent recurrence. Collaborate with development and DevOps teams to optimize and maintain Kubernetes environments and CI/CD pipelines. Develop and refine automation scripts to enhance system reliability, including automated recovery and self-healing capabilities. Build and maintain observability frameworks, integrating metrics, logging, and tracing tools for proactive issue identification. Qualifications: Bachelor's degree in Computer Science More ❯
Cheltenham, Gloucestershire, South West, United Kingdom
Oscar Associates (UK) Limited
SiteReliabilityEngineer | Cheltenham | £600 per day (Outside IR35) About the Role: We're seeking an experienced SiteReliabilityEngineer with live eDV clearance to join an on-site team in Cheltenham. This contract role involves supporting and maintaining a managed cross-domain service, applying SRE practices to ensure reliability, security, and performance. Contract Details: Employment: Contract (Outside IR35) Rate: £600 per day Length: 6 months (long-term extensions very likely). Location: Cheltenham … days on-site) Clearance: Live eDV required Start Date: ASAP Key Responsibilities: Build and deploy code using Java, Maven, NPM, Terraform, and Ansible across OpenShift, RHEL/CentOS, and Docker. Monitor and optimise system performance with Influx and Grafana. Provide 2nd/3rd line support, incident response, and root cause analysis. Carry out BAU maintenance including patching, database housekeeping More ❯
Job Title: CL - SiteReliabilityEngineer (SRE) - Advanced Software Engineer Employment Type: Contract Labor Location: USA-AZ-Scottsdale Responsibilities: Requires a Bachelor's degree in Software Engineering, or a related Science, Engineering or Mathematics field. Also requires 5+ years of job-related experience, or a Master's degree plus 3 years of job-related experience. Agile … classified information. Due to the nature of work performed within our facilities, U.S. citizenship is required. Qualifications: ROLE AND POSITION OBJECTIVES: As a SiteReliabilityEngineer (SRE) for Space and Intelligence Systems line of business, you will be a member of a cross functional team responsible for maintaining survivability and reliability of mission critical resources. SREs … monitoring large scale systems and using automation to triage emerging issues Mixed job experience involving software engineering, systems administration, and network engineering Workplace Options: This position is fully on-site, OR Hybrid/Flex with managers approval. While on-site, you will be a part of the Scottsdale location facility More ❯
Washington, Washington DC, United States Hybrid / WFH Options
ClearanceJobs
Remote - SiteReliabilityEngineer (SRE) ClearanceJobs is aiding their partner, headquartered in New York City and widely recognized as the industry leader in CPS protection, in their search for a skilled SiteReliabilityEngineer (SRE). The selected candidate will support and maintain our customers' FedRAMP- compliant deployment in AWS GovCloud for public sector … customers. The SRE will be responsible for ensuring high availability, security, and compliance of cloud-based environments while driving automation, monitoring, and incident response best practices. U.S. Citizenship (required for working in GovCloud environments) Terms: Fulltime/Direct Hire Location: Remote (DMV area) Salary: $200k - $260k (will fluctuate pending experience) Qualifications: • 6-8+ years of experience in SRE, DevOps … and scripting (Python, Bash). • Experience with logging, monitoring, and observability tools in a cloud-native environment. • Strong troubleshooting, problem-solving, and automation mindset. Responsibilities/Impact as a SRE: • AWS GovCloud Operations: Manage and optimize cloud-based infrastructure in AWS GovCloud, ensuring FedRAMP compliance and high availability. • Reliability & Performance: Monitor and enhance system performance, scalability, and reliabilityMore ❯
SiteReliabilityEngineer - Barcelona Join a leading global travel technology company that's transforming the way businesses manage travel. You will be working with cutting-edge platforms combined with world-class travel inventory with powerful management tools, delivering freedom for travellers … and control for companies, saving time, money,and hassle for everyone. Our client are an award-winning scale-up company seeking a talented SiteReliabilityEngineer (SRE) to help ensure our systems are fast, reliable, and ready to scale. What is in t for you: Salary up to €85,000plus equity in the company. Generous holiday allowance … company events. Mental health support tools. Significant opportunities for growth and progression. The role: Design, build, and maintain scalable, secure cloud infrastructure in AWS. Monitor and manage system performance, reliability, and security. Implement and refine monitoring tools to ensure system health and availability. Partner with development teams to create resilient, secure applications. Participate in our on-call rotation, responding More ❯
Role Overview: We are seeking a highly skilled and motivated SiteReliabilityEngineer (SRE) to join our engineering team to support critical application deployments in a "follow-the-sun" environment. In this role, you will leverage your expertise in cloud provisioning, infrastructure as code, and container orchestration to ensure the reliability, scalability, and performance of our … and versioning. Containerization and Orchestration: Deploy, manage, and provide ongoing support for containerized applications using Kubernetes, including Amazon EKS (Elastic Kubernetes Service) and Azure Kubernetes Service (AKS), ensuring their reliability, availability, and performance. Monitoring and Alerting: Monitor application performance and system health through observability tools (e.g., Prometheus, Grafana, ELK stack), proactively identifying and resolving issues to ensure high availability … and solutions, including RESTful APIs, ensuring seamless integration across platforms. Post-Mortem Analysis: Conduct comprehensive post-mortem analyses following incidents, identifying root causes and recommending improvements to enhance system reliability and performance. Mentorship: Mentor and guide junior engineers, fostering a culture of knowledge sharing and continuous improvement within the engineering team. Skills and Experience: Bachelor's degree in computer More ❯
SiteReliabilityEngineer Hybrid As a senior SRE you would be joining a growing HashiVault squad as part of the strategy to offer more services and a better user experience.. What we're looking for: Hands-on experience with Google Cloud Platform (GCP) Hands-on experience with Kubernetes Experience in implementing technological solutions, managing applications and services … analytical and troubleshooting skills Skills that will help you in the role: Experience with HashiCorp Vault Experience with Jenkins for Cl/cD GCP certifications such as Associate Cloud Engineer, Professional Cloud Architect, Professional Cloud DevOps Engineer, Professional Cloud Security Engineer Hashicorp certifications such as Terraform Associate, Terraform Authoring and Operations Professional, Vault Associate, or Vault Operations … Professional Experience migrating application workloads into GCP and/or GKE SiteReliabilityEngineer Due to the volume of applications received for positions, it will not be possible to respond to all applications and only applicants who are considered suitable for interview will be contacted. Proactive Appointments Limited operates as an employment agency and employment business and More ❯
Role: SiteReliabilityEngineer Client: Defense-Aerospace Hourly Rate: up to $68/hr W2, non-benefited Length … Long-term Location: Scottsdale, AZ Clearance: Department of Defense TS/SCI security clearance is preferred at time of hire. Description: As a SiteReliabilityEngineer (SRE), you will be a member of a cross functional team responsible for maintaining survivability and reliability of mission critical resources. SREs monitor high priority systems and automate recovery mechanisms More ❯
Bath, Somerset, South West, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions
SiteReliabilityEngineer Work From Home (WFH) + Quarterly Visits to Bath Full Time, Initial 12 Month Fixed Term Contract Salary DOE ( ?????? £45k - ? £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliabilityEngineer to join their … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯
Lisburn, County Antrim, United Kingdom Hybrid / WFH Options
Camlin
problems. As of today, the Camlin operation spans over 20 countries across the globe. Job Overview We are seeking a dedicated and experienced SiteReliabilityEngineer (SRE) to join our dynamic team. The SRE will be responsible for ensuring the reliability, performance, and availability of our critical systems and services. This role requires a blend of … software engineering and operations skills to build and run large-scale, distributed, fault-tolerant systems. Key Responsibilities System Reliability and Performance Design, implement, and maintain scalable and reliable infrastructure. Monitor system performance, detect issues, and ensure maximum uptime. Develop and implement strategies for disaster recovery and data backup. Automation and Tooling Automate repetitive tasks to improve efficiency and reduce … Conduct post-incident reviews to identify root causes and prevent recurrence. Develop and maintain incident response protocols and playbooks. Collaboration and Communication Work closely with development teams to integrate reliability into the software development lifecycle. Communicate effectively with stakeholders about system status and health. Provide guidance and mentorship to junior team members. Security and Compliance Ensure systems comply with More ❯
Join us as a Senior SiteReliabilityEngineer - Oracle where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. This role will include: applying software engineering techniques, automation, and best practices in incident response, ensuring the reliability, availability, and scalability of the systems, platforms, and technology through them To be successful as … a Senior SiteReliabilityEngineer - Oracle you should have experience with: Oracle Enterprise manager (OEM), Oracle Internet Directory (OID),Oracle database Performance Tuning - SME Deep understanding of LDAP protocols and directory services. SQL Optimization Strong skills in scripting languages (e.g., Python, Bash) to automate repetitive tasks and knowledge of configuration management tools (e.g., Ansible, Puppet, Chef). … skills This role will be based in our Knutsford campus. Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning. Resolution, analysis More ❯
Are you a seasoned SitereliabilityEngineer looking for an exciting new challenge? Join this team and transition into maintaining and enhancing the reliability of one of the world's largest platforms. In this role, you will utilise your expertise in Golang coding to develop robust applications, ensuring the systems remain resilient, scalable, and efficient. If … presence and commitment to innovation, you will have the opportunity to work on projects that reach millions of users, making a real difference in the tech world. As a SiteReliabilityEngineer, you will be responsible for designing, developing, and maintaining systems and applications using Golang. You will monitor and optimise system performance with tools such as … Grafana, Prometheus, New Relic, and Splunk. Your role will involve identifying and resolving reliability issues, automating processes, and ensuring the seamless operation of the platform. If you have a passion for technology and a drive to ensure excellence, we would love to hear from you More ❯
Overview SiteReliabilityEngineer - Global Network Services Transformation A leading financial technology organisation is embarking on an exciting journey to transform its Network Services Group , and they're now seeking a SiteReliabilityEngineer to join their growing team. This opportunity is perfect for someone who thrives at the intersection of software engineering and … infrastructure reliability . The successful candidate will design, develop, and maintain self-service automation tools that drive efficiency, reduce costs, and improve resilience across one of the world's most sophisticated network infrastructures. Working with colleagues across the US, UK, India, and Singapore , this engineer will play a pivotal role in advancing the company's automation-first approach More ❯
Nottingham, Nottinghamshire, United Kingdom Hybrid / WFH Options
Commify Group
and be part of our success story! Role Summary In the role of SiteReliabilityEngineer at Commify, you will be an integral part of our SRE team. Your focus will be on ensuring that our products and platforms perform at their best, understanding how our software interacts with both physical and Cloud infrastructure to deliver exceptional … What essentials are we looking for? Proficiency with Microsoft Azure Strong expertise in Terraform, App Services, and Kubernetes Fluent in both written and spoken English A genuine passion for reliability in systems Experience in creating and modifying Terraform deployments Prior experience in an operations role, ideally as a SiteReliabilityEngineer Ability to work cross-functionally More ❯
Washington, Washington DC, United States Hybrid / WFH Options
OMW Consulting
Job Title: SiteReliabilityEngineer (SRE) Location: Washington, DC - Hybrid Clearance: TS/SCI Salary: $160k-$200k Join a dynamic team dedicated to delivering best-in-class service quality and issue resolution for mission-critical deployments. In this role, you will be instrumental in shaping operational policies and implementations while working in both on-premise DoD environments … various OSI model layers to meet SLAs. Collaborate with developers to maintain secure and efficient workflows. What We're Looking For: Minimum of 4 years of experience as an SREengineer, with a strong focus on automation and deployment. Active security clearance with experience in DoD IT environments. Proficiency in VMware, Kubernetes, Docker, Helm, Ansible, and Terraform. Strong understanding More ❯
Bath, Avon, England, United Kingdom Hybrid / WFH Options
Deerfoot Recruitment Solutions Ltd
SiteReliability EngineerWork From Home (WFH) + Quarterly Visits to BathFull Time, Initial 12 Month Fixed Term ContractSalary DOE ( ̃ £45k - £60k) + Benefits + Bonus Deerfoot Recruitment is working with an established FCA-authorised outsourced service provider in the financial services sector, seeking a talented SiteReliabilityEngineer to join their IT Operations team. This … cloud architecture Engage in infrastructure design, implementation, and operation to ensure highly available, scalable systems Work collaboratively across development and operations teams throughout the software lifecycle Champion system automation, reliability, and continuous improvement initiatives Monitor production systems with auto … healing and auto-scaling methodologies Support CI/CD pipelines and streamline infrastructure-as-code workflows Maintain strong security-first practices within infrastructure design and management About You Proven SRE generalist with broad cloud infrastructure experience and adaptability Experience deploying cloud infrastructure in a regulated financial services environment Skilled in Terraform and PowerShell automation tools Familiarity with Windows Server and More ❯