setup and maintenance of CICD pipelines. Strong experience working with tools and platforms like Jenkins. Excellent knowledge of AWS cloud services to implement highly efficient architecture. Ability to analyze infrastructure and implement security best practices. Experience with infrastructuremonitoring tools like Nagios, Prometheus, Grafana. Expertise in containerization platforms like Docker and container orchestration platforms like Kubernetes and … Rancher. Familiarity with infrastructure as code tools such as Terraform, CloudFormation, Ansible, etc. Experience in writing automation scripts using Bash, Powershell, and Python. Experience with Windows and Linux systems administration. Qualifications: Bachelor's degree in Computer Science or equivalent. Minimum of 5 years of relevant experience. Excellent communication skills, both oral and written. Willingness to explore and learn new More ❯
and ability to maintain continuous integration, delivery, and deployment (CI/CD) process for a complex set of software requirements and products spread across multiple platforms. Monitor and manage infrastructure, ensuring optimal performance, security, and scalability. Define and develop, test, release, update, and support processes for DevOps operations. Troubleshoot and resolve issues related to application development, deployment, and operations. … Strong knowledge of Shell Scripting and any other programming languages such as Python, C, Groovy, Java , YAML. Experience working on Linux infrastructure. Familiarity with IBM z/OS-based infrastructure is advantageous, particularly in hybrid enterprise environments. Experiential learning on Infrastructure as Code tools (like Terraform, Ansible). Hands-On experience on Container & Container Orchestration tools like Docker … AWS ECS, Kubernetes and infrastructuremonitoring tools like Prometheus and Grafana. Experience with designing, building, and maintaining cloud-native applications across major cloud platforms such as AWS, Azure or GCP is a strong plus. Knowledge of Data Protection, Privacy and Security domain. · Understanding of agile methodologies and principles. Knowledge of databases and SQL. Excellent communication and collaboration skills More ❯
London, England, United Kingdom Hybrid / WFH Options
Protegrity
platforms. You will receive comprehensive training on existing zOS product applications and assembly processes, with a focus on developing and implementing CI/CD capabilities tailored for mainframe systems. Monitoring and managing infrastructure, ensuring optimal performance, security, and scalability. Defining and setting development, test, release, update, and support processes for DevOps operations. Troubleshooting and resolving issues related to … Strong knowledge of Shell Scripting and any other programming languages such as Python, C, Groovy, Java, YAML. Experience working on Linux infrastructure. Familiarity with IBM z/OS-based infrastructure is advantageous, particularly in hybrid enterprise environments. Experiential learning on Infrastructure as Code tools (like Terraform, Ansible). Hands-On experience on Container & Container Orchestration tools like Docker … AWS ECS, Kubernetes and infrastructuremonitoring tools like Prometheus and Grafana. Experience with designing, building, and maintaining cloud-native applications across major cloud platforms such as AWS, Azure or GCP is a strong plus. Excellent communication and collaboration skills, as well as the ability to work effectively in cross-functional teams including nearshore and offshore. Why Choose Protegrity More ❯
GitLab CI), configuration management tools (Ansible, Puppet), and containerization technologies (Docker, ECS, Kubernetes) Monitor system performance, identify bottlenecks, and implement optimizations to improve reliability and efficiency Develop and maintain Infrastructure as Code (IaC) using Terraform, Ansible, AWS CloudFormation, ensuring consistency, repeatability, and compliance Identify and automate application deployment, scaling, and security processes, reducing manual effort and improving reliability Work … Expertise in Docker, ECS, EKS, Kubernetes, implementing security best practices like image vulnerability scanning, Kubernetes RBAC, IAM Roles for Service Accounts (IRSA), Pod Security Policies, and automated compliance enforcement Infrastructure Automation & Security: Strong experience in IaC tools (Terraform, CloudFormation, Ansible), applying least privilege IAM policies, role-based access controls (RBAC), automated compliance checks, and zero-trust security principles Monitoring, Logging & Alerting: Expertise in building centralized logging solutions, integrating ELK Stack, Prometheus, Grafana, Splunk, and AWS-native security monitoring tools such as CloudWatch, Security Hub, SIEM integrations CI/CD Security & Automation: Proficient in Jenkins, Git, GitHub Actions, ensuring secure CI/CD pipelines with artifact encryption, automated security scanning, and DevSecOps best practices Cloud-Based Database Security More ❯
Engineer, you will be part of a dynamic team responsible for designing, implementing, and managing cloud solutions for our prestigious clients. You will leverage your expertise in Azure and Infrastructure as Code to deliver robust, scalable, and secure cloud environments. This role offers significant opportunities for professional growth and the chance to work on some of the largest, most …/Experience Essential Minimum of 3 years' extensive experience with Azure cloud services, including advanced resource management, networking, and security implementations 2 years' hands-on experience with Terraform and Infrastructure as Code methodologies in enterprise environments Proven track record of architecting and implementing complex cloud solutions at scale Advanced troubleshooting skills with the ability to diagnose and resolve intricate … technical issues in production environments Strong knowledge in at least one scripting/programming language (PowerShell, Python, Bash) with demonstrable complex automation implementations Experience designing and implementing robust monitoring solutions and advanced alerting strategies Strong background in cloud security principles and implementation of security controls in Azure environments Demonstrated ability to lead technical discussions with stakeholders and translate business More ❯
Engineer, you will be part of a dynamic team responsible for designing, implementing, and managing cloud solutions for our prestigious clients. You will leverage your expertise in Azure and Infrastructure as Code to deliver robust, scalable, and secure cloud environments. This role offers significant opportunities for professional growth and the chance to work on some of the largest, most …/Experience Essential Minimum of 3 years' extensive experience with Azure cloud services, including advanced resource management, networking, and security implementations 2 years' hands-on experience with Terraform and Infrastructure as Code methodologies in enterprise environments Proven track record of architecting and implementing complex cloud solutions at scale Advanced troubleshooting skills with the ability to diagnose and resolve intricate … technical issues in production environments Strong knowledge in at least one scripting/programming language (PowerShell, Python, Bash) with demonstrable complex automation implementations Experience designing and implementing robust monitoring solutions and advanced alerting strategies Strong background in cloud security principles and implementation of security controls in Azure environments Demonstrated ability to lead technical discussions with stakeholders and translate business More ❯
Engineer, you will be part of a dynamic team responsible for, implementing, managing and maintaining cloud solutions for our prestigious clients. You will leverage your expertise in Azure and Infrastructure as Code to deliver robust, scalable, and secure cloud environments. This role offers significant opportunities for professional growth and the chance to work on some of the largest, most … industry. Key Skills/Experience Essential Minimum of 18 months experience with Azure cloud services, including advanced resource management, networking, and security implementations Hands-on experience with Terraform and Infrastructure as Code methodologies in enterprise environments, ability to create and modify infrastructure. Strong troubleshooting skills with the ability to diagnose and resolve intricate technical issues in production environments Experience … in at least one scripting/programming language (PowerShell, Python, Bash) with demonstrable complex automation implementations Experience implementing robust monitoring solutions and advanced alerting strategies Strong background in cloud security principles and implementation of security controls in Azure environments Experience mentoring team mates and contributing to team knowledge growth Strong analytical mindset with exceptional problem-solving abilities Must be More ❯
solutions for third-party products Experience with Agile and DevOps methodologies Experience with Linux operating system Experience with relational and NoSQL databases (Postgres, Dynamo and others) Familiarity with application monitoring, infrastructuremonitoring and log aggregation tools like Datadog. Experience managing source code control and CI/CD tools like GitHub, Jenkins or similar Experience with developing multi More ❯
solutions for third-party products Experience with Agile and DevOps methodologies Experience with Linux operating system Experience with relational and NoSQL databases (Postgres, Dynamo and others) Familiarity with application monitoring, infrastructuremonitoring and log aggregation tools like Datadog. Experience managing source code control and CI/CD tools like GitHub, Jenkins or similar Experience with developing multi More ❯
London, England, United Kingdom Hybrid / WFH Options
Story Terrace Inc
we are already serving some of the world’s largest companies, providing them with superior credit protection and innovative risk management technology About the role: We’re seeking an Infrastructure Engineerto join our technical team. This role is ideal for someone with 2+ years of experience who is passionate about maintaining and improving robust infrastructure systems while contributing … to data management initiatives. You’ll work closely with our CTO and engineering team to ensure our infrastructure remains reliable, cost-efficient, and adaptable to meet the needs of high-value users. This is a fantastic opportunity to join a fast-growing, well-funded start-up that is disrupting the global credit insurance industry, growing and developing with the … company. Responsibilities: Maintain and improve existing infrastructure built on AWS ECS with observability tools in place Improve automation for deployments and infrastructure management Collaborate with development teams to streamline the CI/CD pipeline Maintain and enhance monitoring and alerting systems Monitor system performance to ensure reliability for high-value users Proactively manage alerts, licensing, costs and More ❯
London, England, United Kingdom Hybrid / WFH Options
IdentityE2E
Key Responsibilities Lead and manage the Operational Acceptance Testing (OAT) process, ensuring systems meet operational and business requirements before deployment. Define OAT test strategies, plans, and criteria to validate infrastructure, system reliability, failover mechanisms, and non-functional requirements. Collaborate with DevOps, Infrastructure, and Release Management teams to ensure smooth transition from development to production on AWS cloud. Conduct … Expertise 5+ years of experience in Quality Assurance, or Operational Testing. Deep understanding of OAT methodologies, including but not limited to the following core areas: Build verification Alert and Monitoring Backup and Restore Resilience and Recovery Logging, Audit and House Keeping Release Management Work Instructions OWASP "top ten" security tests Experience with CI/CD pipelines, Jenkins, and test … automation frameworks. Knowledge of cloud platforms (AWS/Azure), infrastructuremonitoring, and IT service management. Familiarity with databases, Linux environments, and network protocols. Experience in working with Agile & DevOps methodologies. Strong analytical and problem-solving skills with attention to detail. Excellent communication and stakeholder management abilities. Why Apply? We invest in your growth with an ACloudGuru subscription and More ❯
London, England, United Kingdom Hybrid / WFH Options
Ikerian
are collectively shaping the future of healthcare. We are looking for a Cloud Engineer to join our team, where you will play a critical role in optimising our cloud infrastructure to meet our customers' needs. This position is pivotal to our mission of providing a seamless, secure, and scalable experience on our platform About Us Ikerian AG (formerly RetinAI … are collectively shaping the future of healthcare. We are looking for a Cloud Engineer to join our team, where you will play a critical role in optimising our cloud infrastructure to meet our customers' needs. This position is pivotal to our mission of providing a seamless, secure, and scalable experience on our platform Key Responsibilities Enhance and maintain highly … available, secure, and scalable AWS cloud environments and services. Manage and prioritise tasks in the cloud infrastructure backlog to address immediate needs and plan long-term improvements. Set up infrastructuremonitoring and observability solutions, proactively addressing availability, performance or security issues. Assess new technologies, systems, and services for production readiness, ensuring seamless and stable integration. Prepare and More ❯
London, England, United Kingdom Hybrid / WFH Options
Cloud Decisions
InfrastructureMonitoring & Support Engineer - 3 Positions Available! To £45,000 + Excellent Benefits (Negotiable for the right person) Hybrid Role with an Exceptional Number of Days off Work Location: London (SE1) or Newport in Wales (NP10) Enterprise Microsoft Consulting Partner | Rapidly growing organization - 50% revenue growth last year | Enviable Company Culture with an Inclusive and Fun Atmosphere “Without … for enterprise level customers, this diverse and multi-skilled partner have fully submerged themselves within the Cloud, giving you exposure to a wide range of cutting-edge technologies. “The InfrastructureMonitoring Engineer – The heartbeat of the support team” The InfrastructureMonitoring Team is responsible for maintaining the health of the organization's estate (both customer and … internal). This is achieved by maintaining a 24/7/365 engineer presence and responding to proactive and real-time alerting from a Remote Monitoring and Management (RMM) platform and carrying our investigation and remediation for areas covering availability, network health, storage capacity, antivirus, patching, backup jobs, performance, hardware issues and other infrastructure related issues. In More ❯
combine Appliance and Datacenter server environments regardless of the underlying infrastructure. The team uses the latest automation technologies to reduce operational toil, improve the stability of our mission critical infrastructure services, and to facilitate maintenance of Bloomberg's server fleet at scale. You'll work closely with Software Developers, Frontline teams, and other key stakeholders to ensure that we … flow charts & procedures, perform capacity planning, and implement changes Diagnose and resolve critical system issues Continuously refine processes and procedures with a focus on standardization and automation. Enhance our monitoring and alerting solutions You’ll Need To Have: Experience programming in Python (or related language) and a good understanding of software development methodologies, open source systems, and familiarity with … and share knowledge We’d Love To See: RHEL System Administrator level of competency An understanding of some or all of the following: configuration management, orchestration, CI/CD, infrastructuremonitoring and telemetry Experience using Agile (e.g. Kanban or Scrum) Familiarity with telemetry tools such as Splunk, Grafana Experience with Web frameworks (BENTO, REACT, Angular, DJANGO) Bloomberg is More ❯
Senior Infrastructure Engineer - Managed Systems Operability Location London Business Area Engineering and CTO Ref # 10043720 Description & Requirements As a Systems Operability Engineer, you will join a highly motivated, experienced, and dynamic team responsible for providing automatic remediation solutions, stability efforts, and self-service tooling to manage Bloomberg’s server fleet. Our team offers a low-friction customer experience … developing tools that unify Appliance and Datacenter server environments, regardless of underlying infrastructure. We leverage the latest automation technologies to reduce operational toil, enhance the stability of mission-critical infrastructure services, and facilitate large-scale maintenance of Bloomberg's server fleet. You will collaborate closely with Software Developers, Frontline teams, and other stakeholders to ensure alignment with business strategies … Draft process flowcharts & procedures, perform capacity planning, and implement changes Diagnose and resolve critical system issues Continuously refine processes and procedures with a focus on standardization and automation Enhance monitoring and alerting solutions You’ll Need To Have: Programming experience in Python (or related languages) and a good understanding of software development methodologies, open-source systems, and familiarity with More ❯
Social network you want to login/join with: Observability/Monitoring Engineer - Grafana Dashboarding, london col-narrow-left Client: Location: london, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Views: 4 Posted: 26.06.2025 Expiry Date: 10.08.2025 col-wide Job Description: We’re seeking an experienced contractor to support the delivery of observability … solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with Grafana dashboard creation, templating, and performance optimization. Strong understanding of PromQL, VictoriaMetrics … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructuremonitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. #J-18808-Ljbffr More ❯
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructuremonitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructuremonitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
We’re seeking an experienced contractor to support the delivery of observability solutions for a new, large-scale infrastructure environment. This role focuses on developing insightful and automated Grafana dashboards, with a strong emphasis on data integration and actionable telemetry. Required Skills Excellent, concise communication skills - essential for collaborating with technical teams to shape observability outputs. Deep experience with … Bonus/Nice-to-Have Skills: Experience deploying Grafana instances via code (provisioning dashboards, datasources). Familiarity with OpenTelemetry, metric instrumentation, and telemetry pipelines. Background in data center environments, infrastructuremonitoring, or SRE practices. Exposure to CI/CD workflows, containers (Podman/Docker), and cloud-native systems. More ❯
Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while … driving operational excellence. Key responsibilities include: Supporting and enhancing existing network infrastructure Developing observability tools and self-healing/event-driven automation Performing advanced troubleshooting and incident resolution Contributing to the evolution of a high-performance compute datacentre Skills Required: Proficient in monitoring and resolving incidents across diverse environments Strong diagnostic skills in network infrastructure, collaborating closely … with vendor support teams for in-depth investigations when needed Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability Implement BAU changes with a focus on automation, fostering More ❯
Alexander Ash is seeking Network Operations Engineers on behalf of our global, technology-driven financial services client. As part of the global infrastructure team, you will play a key role in managing and maintaining the firm’s network infrastructure and automation systems. The team’s mission is to ensure the availability, reliability, and security of the platform while … driving operational excellence. Key responsibilities include: Supporting and enhancing existing network infrastructure Developing observability tools and self-healing/event-driven automation Performing advanced troubleshooting and incident resolution Contributing to the evolution of a high-performance compute datacentre Skills Required: Proficient in monitoring and resolving incidents across diverse environments Strong diagnostic skills in network infrastructure, collaborating closely … with vendor support teams for in-depth investigations when needed Lead the creation and enhancement of monitoring dashboards, proactively addressing alerts based on priority and driving continuous improvements in alerting mechanisms and system observability Facilitate post-incident reviews to identify and implement improvements that enhance infrastructure reliability and availability Implement BAU changes with a focus on automation, fostering More ❯
data access and management at scale with AI. These solutions include Asset and Risk intelligence, Attack Analysis, Orchestration Automation and Response, User Behavior Analytics, SIEM Enterprise Security, Application Performance Monitoring, InfrastructureMonitoring, Log Analysis, Incident Response, Network Monitoring, Business Risk Observability, AIOps, Digital Experience Monitoring and the ecosystem continues to expand and integrate at a … demonstration skills with an ability to use stories effectively. Ability to actively listen. Goodorganisational, prioritisation and time management skills. Technical Knowledge: Linux or Windows knowledge Basic network connectivity troubleshooting Infrastructure as aService platforms: AWS, GCP and/or Azure IT architecture concepts such as High Availability, Disaster Recovery Desirable Knowledge and Experience: Experience selling SaaS services or data solutions More ❯
Merton, Wisconsin, United States Hybrid / WFH Options
QuadMed
related technologies and relevant processes. This role is ideally filled by someone who is adaptive, organized, strong in project management, has a customer service focus and background in technical infrastructure, networking, hardware and desktop support. The role involves overseeing one or more infrastructure or support teams, fostering a collaborative and positive environment and making decisions that balance the … continuous improvement and world-class customer service to develop and lead team to build Raving Fans through exemplary service and follow through. Accountable for all health center and corporate infrastructure, ensuring uptime, solution delivery, IT inventory and lifecycle management and adherence to site checklists and security protocols. Responsible for annual client infrastructure budget planning, presentation and adherence. Collaborate … with IT, Facilities & Implementation and Operations on setting health center hardware/technology/infrastructure standards, RFP proposals, implementations and closures. Facilitate Big Room Planning and Agile functions for the team, including capacity planning, tracking, and workload management, in partnership with Quad and QuadMed IT. Focus on IT Scorecard metrics and mentor team on establishing and performing proper infrastructureMore ❯
Responsibilities: Provision and rollout of end user technologies (both hardware and software). Take ownership of incidents and projects, liaise with other team members and third parties when required. Infrastructuremonitoring, maintenance and support, cloud environment administration and support. Continual service improvement and project proposals, design, trials, and delivery. Attention to underpinning documentation (policy, process, procedure and best More ❯