Terraform, GKE cluster, IaC, code reviews, CI/CD, Kubernetes, node pools, system reliability, Cloud reliability, security . Experience level: Mid-senior Experience required: 9 Years Education level: Bachelors degree Job function: Information Technology Industry: Information Technology and Services Pay rate : View hourly payrate Total position: 1 Relocation … cross-functional teams to optimize infrastructure performance and cost. Create and maintain comprehensive documentation for GKE cluster usage, architecture, and lifecycle management. Ensure system reliability, security, and scalability in cloud infrastructure design and deployment. Strong experience with Kubernetes More ❯
We believe in celebrating one another's achievements, open communication, and transparent leadership. About the Position The DevOps Engineer role reports to the SRE Lead, working as part of a small team providing and maintaining the development tools (CI/CD tool chain), providing DevOps excellence and best practice More ❯
Senior Cloud/SREEngineer Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems? About Team: The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that … contributes directly to project plans, schedules, and methodologies for implementing cross-functional software assets and infrastructure. Responsibilities include cloud platform design across multiple systems, SRE activities, mentoring less-experienced team members, and collaborating with users, customers, and stakeholders to translate their requirements into effective solutions. Additionally, it focuses on fostering … Kubernetes/EKS). Proficiency in scripting languages (e.g., Python, Bash, TypeScript, PowerShell). Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent More ❯
This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to support platforms worldwide. We are looking for SRE talent with experience in an On-Prem/Datacenter environment. The ideal candidate will bring strong technical leadership, experience in … high-impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations, Security More ❯
This role plays a key part in the global follow-the-sun support model, working closely with the Global SRE Leader to support platforms worldwide. We are looking for SRE talent with experience in an On-Prem/Datacenter environment. The ideal candidate will bring strong technical leadership, experience in … high-impact team. You'll collaborate with Engineering, Infrastructure, and Operations teams to maintain high availability and resilient service delivery, while also mentoring a SRE team focused on continuous improvement and innovation. Key Responsibilities: Technical Leadership Develop deep expertise in the Titanium trading platform to lead and support critical business … ensuring priorities align with business goals and resource capacity. Operational Excellence Champion initiatives that enhance system availability, scalability, and performance. Collaborate with the Global SRE Leader to refine and enforce operational policies (e.g., Capacity Planning, Change Management, Disaster Recovery). Cross-Functional Collaboration Partner with Software Engineering, Infrastructure, Operations, Security More ❯
Cheltenham, Gloucestershire, United Kingdom Hybrid / WFH Options
TwinStream
consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home. Location: Hybrid working in Cheltenham with possible 24/7 call out when on rota Security Clearance: Must … practices. Experience building and maintaining robust CI/CD pipelines. Proven experience deploying full-stack solutions to cloud infrastructure. Comprehensive experience in implementing Service Reliability processes. Understanding of agile software development principles and practices, with the ability to collaborate in a fast-paced, evolving environment. Knowledge of or understanding More ❯
and more who are enabling organizations worldwide to harness speed and agility with a cloud-first strategy. We're looking for an experienced Manager, SiteReliability Engineering (Federal) to lead our ZPA team. This role requires Secret Security Clearance that you must maintain throughout employment. An Information Assurance … Level 2 Certification is also required, but you can obtain this certification within your first few weeks of employment. Reporting to the Senior Manager, SRE, you will: Lead and mentor a team of SREs responsible for the operational health of Federal cloud products, including deployments, on-call operations, and incident … SLA requirements What We're Looking For (Minimum Qualifications) 7+ years of relevant experience, including 2+ years of managerial or team leadership experience in SRE, DevOps, or similar roles within regulated frameworks. Deep understanding of FedRAMP authorization levels and experience implementing vulnerability management programs Proficiency in Linux administration, infrastructure automation More ❯
the ability to "work from anywhere" for two weeks of the year Paid one month sabbatical after four years' employment Role Overview Luminance's SRE team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a … response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of customer satisfaction. With a focus on automation, scalability, reliability and security, the team enable Luminance to ensure a performant, seamless experience for its users. You will join a small, dynamic team of creative More ❯
Do Ready to lead game-changing tech initiatives in renewable energy? You'll architect, optimize, and scale cloud infrastructure, drive automation , and ensure system reliability in a dynamic, multi-technology environment. From running Kubernetes in production to solving large-scale cloud challenges , you'll design robust, scalable solutions and … mentor top engineers along the way. Optimize & Scale - Design and improve cloud infrastructure for reliability, security, and efficiency. Kubernetes & Cloud - Manage, troubleshoot, and scale K8s environments in production. Infrastructure as Code (IaC) - Automate deployments using Terraform, Helm, Pulumi, or similar tools. Troubleshooting - Diagnose and resolve production issues in cloud … deployment and operation of systems. Innovation & Mentorship - Research new tools, drive improvements, and mentor engineers in DevOps best practices. Your Superpowers ️ 10+ years in SRE, Platform Engineering, DevOps, or Cloud Infrastructure Expert in Kubernetes (scaling, troubleshooting, managing clusters in production) Strong in Infrastructure as Code (IaC) - Terraform, Helm, Pulumi, or More ❯
Offchain Labs has developed a suite of scaling solutions for Ethereum. This includes Arbitrum, an Optimistic Rollup, that instantly scales apps, reducing costs and increasing capacity, without sacrificing Ethereum's security. Porting contracts to Arbitrum requires no code changes or More ❯
At Offchain Labs, we are not just building products - we're leading a movement. We are committed to creating a decentralized, secure, and transparent future through blockchain technology. Our mission is to push the boundaries of blockchain innovation and foster More ❯
Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including … infrastructure. Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues. Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design. Drive a culture of intolerance to manual activity which results in a highly automated environment More ❯
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we More ❯
development team Be on call and respond to incidents promptly Who We Are Looking For Have at least 5 years of experience as a SRE or DevOps Build infrastructure with Murphy's Law in mind Are keen on monitoring atmospheric pressure in the data center Trust people but automate all More ❯
What if your coding skills could do more than just build software, what if they could make entire systems faster, smarter, and unbreakable? Bringing your skills in software development, you'll work on internal tooling and automation, using Golang, Python More ❯
we look for? We are looking for a seasoned and highly technical cloud Linux Engineer to be part of the Enterprise Cloud Infrastructure SRE Team and be part of the multi-year cloud transformation program that we have embarked upon. 100% remote position for people located in Spain. Key … Responsibilities: Engineer a best-in-class cloud IAAS service offering. Engineer and integrate cloud IAAS solutions & services to industry best practice with scalability, fault tolerance, highly resilient, highly secure, observability, and simplicity in mind Run proof of concepts for new cloud IAAS or container services or cloud third … service or enhancement priorities Work closely with development teams to help them through the cloud journey, evaluate and identify new cloud service requirements, and engineer and implement globally Work with developers to achieve the most optimal device performance and concurrency, by applying appropriate tuning techniques. Identify, triage, and constantly More ❯
contributes directly to project plans, schedules, and methodologies for implementing cross-functional software assets and infrastructure. Responsibilities include cloud platform design across multiple systems, SRE activities, mentoring less-experienced team members, and collaborating with users, customers, and stakeholders to translate their requirements into effective solutions. Additionally, it focuses on fostering … Kubernetes/EKS). Proficiency in scripting languages (e.g., Python, Bash, TypeScript, PowerShell). Knowledge of networking concepts and security best practices. Familiarity with SRE activities and best practices. Familiarity with DevOps practices and tools. Experience with monitoring and logging tools (e.g., DataDog, Coralogix, AWS CloudWatch, Azure Monitor). Excellent More ❯
leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications. We are looking for an experienced Lead for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Lead SRE, you will be very hands-on technically while also directly … adheres to the highest security standards. They build essential security infrastructure and implement controls that reinforce the platform's security posture. This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions. This team is deeply involved … implement, and manage cloud-native security tools and platforms for endpoint security, identity management (IAM), and CSPM Qualifications: Experience: 7+ years of experience in SRE, infrastructure engineering or similar role, with a strong focus on security work, with ideally 2+ years in a leadership or senior engineering role Security Mindset More ❯
ever growing demand. It is also why we are once again hiring. We are a group of seasoned professionals that lean on collaboration, agile & SRE best practices. Team health and achieving a productive and sustainable work culture is important to us. We strive to improve the flow of work and … with CI POLY security clearance. Applicants must already possess a valid and active TS/SCI with CI POLY security clearance. - 5+ years of SRE/DevOps experience in automating deployments, scaling, and management of containerized applications within Kubernetes. - Enterprise Experience in public cloud platform (ideally on AWS or GCP More ❯
We have an exciting opportunity for an SRE/DevOps Engineer to join a leading software house. Reporting to the Head of Engineering, this role offers the chance to stay hands-on and participate in strategic decisions. We want someone with fresh ideas and supporting experience who enjoys collaborating … with a cross-functional team to develop real-world solutions and positive user experiences. Responsibilities: Ownership and implementation across multiple projects from our SRE strategy that focuses on maturity, scalability, resilience, security and automation. Manage and maintain environments to ensure high availability and security. Build and maintain infrastructure as code … identify and address performance bottlenecks, security vulnerabilities, and system failures. Automate manual tasks to improve operational efficiency and reduce technical debt. Collaborate with other SRE engineers to ensure resilience and scalability across the platform. Work with our teams to directly influence and drive the adoption of SRE best practices and More ❯
Job Title: Product Support - SRE/Engineering Job Location: El Segundo, CA 90245 Onsite Requirements: establishing product support strategies. experie3nce integrating system engineering design characteristics. integrating support elements, procurement and budgeting strategy into logistics activities. Job Description: Experience with establishing product support strategies that align with acquisition strategies and identifying … required infrastructures, processes, and procedures to maintain system capabilities. Experience with integrating system engineering quantitative design characteristics (reliability, maintainability, etc.) with the functional logistics/integrated product support elements to identify product support resources. Experience with integrating the 12 product support elements, procurement and budgeting strategy into logistics activities More ❯