of flagging deterioration of critical equipment and components therein. Capable of implementing Reliability-Centered maintenance processes and techniques. Capable and proficient in leading and implementing Failure Mode and Effect Analysis (FMEA) reports. Capable of conducting a detailed rootcauseanalysis (RCA) report. Develop reliability reports to assist in identifying areas for increased efficiency, resiliency, or to More ❯
of flagging deterioration of critical equipment and components therein. Capable of implementing Reliability-Centered maintenance processes and techniques. Capable and proficient in leading and implementing Failure Mode and Effect Analysis (FMEA) reports. Capable of conducting a detailed rootcauseanalysis (RCA) report. Develop reliability reports to assist in identifying areas for increased efficiency, resiliency, or to More ❯
of flagging deterioration of critical equipment and components therein. Capable of implementing Reliability-Centered maintenance processes and techniques. Capable and proficient in leading and implementing Failure Mode and Effect Analysis (FMEA) reports. Capable of conducting a detailed rootcauseanalysis (RCA) report. Develop reliability reports to assist in identifying areas for increased efficiency, resiliency, or to More ❯
of flagging deterioration of critical equipment and components therein. Capable of implementing Reliability-Centered maintenance processes and techniques. Capable and proficient in leading and implementing Failure Mode and Effect Analysis (FMEA) reports. Capable of conducting a detailed rootcauseanalysis (RCA) report. Develop reliability reports to assist in identifying areas for increased efficiency, resiliency, or to More ❯
of flagging deterioration of critical equipment and components therein. Capable of implementing Reliability-Centered maintenance processes and techniques. Capable and proficient in leading and implementing Failure Mode and Effect Analysis (FMEA) reports. Capable of conducting a detailed rootcauseanalysis (RCA) report. Develop reliability reports to assist in identifying areas for increased efficiency, resiliency, or to More ❯
of flagging deterioration of critical equipment and components therein. Capable of implementing Reliability-Centered maintenance processes and techniques. Capable and proficient in leading and implementing Failure Mode and Effect Analysis (FMEA) reports. Capable of conducting a detailed rootcauseanalysis (RCA) report. Develop reliability reports to assist in identifying areas for increased efficiency, resiliency, or to More ❯
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Anecdote
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
london, south east england, united kingdom Hybrid / WFH Options
Anecdote
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
slough, south east england, united kingdom Hybrid / WFH Options
Anecdote
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
london (city of london), south east england, united kingdom Hybrid / WFH Options
Anecdote
Take basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost tuning. Participate in on‐call for your area and drive root‐causeanalysis with crisp follow‐ups. 15% Collaborate Pair with back‐end & front‐end to wire extractors/detectors and agents into ticketing, voice, and analytics stacks … ADK preferred ). Proven track record shipping AI agents and building RAG pipelines. LLM + DS depth: Prompting/tooling, retrieval design, LLM evals; hands‐on with time‐series analysis (forecasting, change‐point, drift). Cloud & ops: Basic infra ownership on GCP (or AWS/Azure): networking, autoscaling, CI/CD, IaC, observability, and cost control. Communication: You explain More ❯
Solution Design and Implementation: Design, implement, and maintain Teamcenter solutions that integrate with generative AI technologies, such as AI-powered design, simulation, or manufacturing process optimization. Requirements Gathering and Analysis: Collaborate with stakeholders to understand business needs and translate them into functional and technical requirements for Teamcenter and AI integrations. Technical Leadership: Provide technical guidance and support for Teamcenter … experience (ITK, C, C++, Java, scripting languages), database knowledge, and server setup/performance tuning. Problem-Solving and Analytical Skills: Strong ability to diagnose and resolve technical issues, conduct rootcauseanalysis, and develop solutions. Communication and Collaboration Skills: Excellent written, verbal, and interpersonal communication skills to effectively collaborate with stakeholders. Project Management: Experience with project planning More ❯
internal/external) and drive improvement of the management system. Be data driven to assess processes, take measurements, and interpret data to facilitate continuous improvement in manufacturing operations. Perform rootcauseanalysis (8D,5 Whys) of identified manufacturing issues and identify appropriate corrective and preventive action. Participate in cross-functional teams in the investigation and elimination of … capable of communicating with plant personnel at all levels. Excellent written and oral communication skills; composing written reports and other communications. Superior analytical skills, capable of using current computer analysis tools (Microsoft Office, Outlook plus others applicable programs). Ability to work independently with a strong work ethic, is innovative, detail oriented, possesses initiative. Knowledge of ETQ Reliance Quality More ❯
Los Angeles, California, United States Hybrid / WFH Options
INSPYR Solutions
You should have strong coding skills, a passion for automation, and a focus on reliability engineering to deliver robust and maintainable systems. You will work on network design, traffic analysis and engineering, maintaining CI/CD pipeline and creating tools to enhance observability and streamline troubleshooting for core infrastructure services. Your role will include: Designing, deploying, and operating the … strategies. QoS experience across multiple vendor hardware implementations. Troubleshooting and Incident Response: Skilled at troubleshooting live incidents, with a proactive approach to minimizing downtime and service impact. Familiarity with RootCauseAnalysis (RCA) processes to identify, document, and drive long-term solutions to recurring issues. Automation and Scripting: Proficiency in scripting and programming languages like Python and More ❯
Newcastle Upon Tyne, Tyne and Wear, England, United Kingdom
Jackson Hogg - Tech
service level agreements. What you’ll be doing: Experience with an ITSM platform to manage, prioritise, and track support requests. Diagnose and resolve complex hardware and software issues. Conduct rootcauseanalysis to minimise recurring issues and enhance service quality. Ensure system security and stability through regular patching and hardening measures. Support the team development by sharing More ❯
compliance, and security policies across cloud estates. - Execute backup, disaster recovery, and business continuity procedures. - Maintain and troubleshoot Azure and hybrid cloud environments. - Perform proactive monitoring, incident response, and rootcauseanalysis of mission-critical systems. - Configure, optimise, and secure servers, virtual machines, networking, and storage solutions. - Create and maintain scripts (e.g., PowerShell) to automate operational tasks. More ❯
compliance, and security policies across cloud estates. - Execute backup, disaster recovery, and business continuity procedures. - Maintain and troubleshoot Azure and hybrid cloud environments. - Perform proactive monitoring, incident response, and rootcauseanalysis of mission-critical systems. - Configure, optimise, and secure servers, virtual machines, networking, and storage solutions. - Create and maintain scripts (e.g., PowerShell) to automate operational tasks. More ❯
on cutting-edge cloud technologies, contribute to process improvements, and be instrumental in enhancing our service delivery. Key Responsibilities - Lead resolution of complex technical escalations from Tier 2, ensuring rootcauseanalysis and permanent fixes. - Act as a subject matter expert across Azure, M365, Exchange, Intune, and virtualised environments. - Design and implement automation scripts and workflows using More ❯
on cutting-edge cloud technologies, contribute to process improvements, and be instrumental in enhancing our service delivery. Key Responsibilities - Lead resolution of complex technical escalations from Tier 2, ensuring rootcauseanalysis and permanent fixes. - Act as a subject matter expert across Azure, M365, Exchange, Intune, and virtualised environments. - Design and implement automation scripts and workflows using More ❯
SQS Implement system monitoring, logging, and alerting to ensure reliability and performance Support CI/CD pipelines and work with developers to streamline deployments Troubleshoot production issues and conduct rootcauseanalysis Maintain security best practices and ensure compliance across cloud platforms Continuously evaluate and optimise infrastructure for performance and cost Key Skills & Experience Proven experience in More ❯
SQS Implement system monitoring, logging, and alerting to ensure reliability and performance Support CI/CD pipelines and work with developers to streamline deployments Troubleshoot production issues and conduct rootcauseanalysis Maintain security best practices and ensure compliance across cloud platforms Continuously evaluate and optimise infrastructure for performance and cost Key Skills & Experience Proven experience in More ❯
Southborough, Kent, United Kingdom Hybrid / WFH Options
Vermelo RPO
Firm understanding of Agile concepts Experienced in Web and Device testing (mobile and cross browser) Excellent written and spoken English Excellent attention to detail Able to troubleshoot issues with rootcauseanalysis Desired Skills: Experience with Azure Experience with Microsoft Visual Studio Experience with .NET Experience with Big Data Database Technologies, DataLake, CosmosDb, SQL Experience with Telemetry More ❯
Tunbridge Wells, Kent, United Kingdom, Southborough Hybrid / WFH Options
Vermelo RPO
Firm understanding of Agile concepts Experienced in Web and Device testing (mobile and cross browser) Excellent written and spoken English Excellent attention to detail Able to troubleshoot issues with rootcauseanalysis Desired Skills: Experience with Azure Experience with Microsoft Visual Studio Experience with .NET Experience with Big Data Database Technologies, DataLake, CosmosDb, SQL Experience with Telemetry More ❯
API and SQL Working with Product, DevOps and QA to plan and deliver projects Improving CI/CD pipelines, coding standards, tooling and secure development practices Managing incident handling, rootcauseanalysis and documentation Collaborating with offshore developers to maintain consistent quality Tech landscape C#, .NET Core, MVC, Web API Azure SQL Server, Entity Framework React/ More ❯
Provide daily administration of the Windows based Virtual Machines and AECC services. Harden the Windows based services per commercial best practices and the required government cybersecurity controls. Troubleshooting and rootcauseanalysis of any Windows issues. Coordinate and perform major OS version upgrades of virtual machines needing upgrades. Develop and maintain system, infrastructure, and process documentation (system More ❯