learning model training as well as large scale inferencing in a Kubernetes environment. Responsibilities: Experience using the Linux CLI and Linux tools Experience developing with Python Experience with developing highavailability APIs and microservices Experience with container technologies such as Docker and containerd Experience with container orchestration technologies such as Kubernetes Experience creating Helm Charts for Kubernetes Experience More ❯
Kubernetes, as well as Azure Cloud, Azure SQL Database, and Oracle Database with PL/SQL. What tasks await you? Administration, configuration, and implementation of databases in a demanding high-availability environment (Oracle) Planning and provisioning of new databases Automation of existing processes Performing database migrations, patches, and upgrades Consulting and support of projects Ensuring that storage and More ❯
for obtaining results; act independently on new or special assignments and mentor others. Manage the day-to-day operations of the cloud infrastructures for Aptean's Business Systems, Applications, HighAvailability, and BAU. Continuously examine and re-evaluate existing systems to provide efficiencies within the cloud infrastructures. Maintain an efficient environment to keep BAU costs to a minimum. More ❯
date and aligned with industry and internal standards Implement and manage network access control using Cisco ISE for 802.1x and TACACS Manage F5 Big-IP load balancers to ensure highavailability and performance of critical applications Maintain and troubleshoot BGP and OSPF routing protocols and participate in the design and implementation of routing and switching solutions Monitor the More ❯
License Bulk Location New York Business Area Engineering and CTO Ref # Description & Requirements Description & Requirements Are you a software engineer interested in owning, developing and improving mission-critical high-throughput systems that process hundreds of billions of data points daily? Do you want to support Bloomberg's Enterprise Data business, which is situated at the intersection of rapid … growth and high revenue? Would you love to work with technologies such as Python, C++, Celery, RabbitMQ, Kafka, and Comdb2? Then we want you in the Data License Bulk team at Bloomberg! Bloomberg Enterprise Data provides financial institutions a platform to make millions of requests per day for a wide range of financial data. Whether it's ESG, Company … This translates into hundreds of billions of data points flowing through the Data License Bulk Platform daily, that are carved into thousands of financial datasets. With a focus on highavailability, low latency, and resiliency, the Data License Bulk Platform provides a set of managed, highly distributed workflows to create, verify, and deliver these datasets to thousands of More ❯
Lead and mentor the IT systems team, assigning tasks, ensuring adherence to best practices, and fostering professional development. Provide advanced technical support and guidance for system-related issues, ensuring high performance, security, and availability of IT systems. Coordinate and manage IT projects, including system upgrades, network improvements, and integration of new technologies, ensuring projects are delivered on time. … Skills/Experience): Extensive experience with both RedHat Linux and Microsoft Windows servers, including installation, configuration, administration, and troubleshooting. Proven ability to manage, optimize, and secure server environments, ensuring highavailability, performance, and scalability. Familiar with advanced features such as virtualization, automation, and patch management across both platforms, and capable of integrating these systems into a cohesive IT More ❯
REST API's, observability and monitoring, queue technologies and security. Detailed knowledge of best practices such as SOLID principles Experience of building new and evolving microservices with emphasis on highavailability and data integrity. Highly experienced in test methodologies including unit testing with JUnit, use of TDD, integration tests and end-to-end testing, as a way to … promote high quality codebases. Working with relational databases -Postgres would be of particular relevance. Supporting and developing complex architecture including APIs Willing to be part of a 24/7 L3 support team (on call) Management and resolution of tickets relating to incident, change, service request and problem processes in accordance with the agreed PPPT/DDaT operating models … in the organisation. AWS Kubernetes PHP/Drupal WordPress: Caching, CDN integration, and database tuning All profiles will be reviewed against the required skills and experience. Due to the high number of applications we will only be able to respond to successful applicants in the first instance. We thank you for your interest and the time taken to apply More ❯
t just write code-we build mission-driven systems that push the boundaries of what's possible. Join a collaborative team where your expertise in software engineering, DevOps, and high-performance computing contributes to real-world impact. Your ideas are valued, your skills are sharpened, and your contributions drive innovation that matters. What You'll Do: As a Software … Design, implement, and maintain CI/CD pipelines for C++ or Python applications using Kubernetes (K8s), Helm charts, and pod definition. • Kubernetes Administration: Manage and configure Kubernetes clusters for highavailability, scalability, and security. • Debugging and Defect Correction: Troubleshoot and resolve software defects with effective root cause analysis and debugging techniques. • GPU Configuration and Support: Configure and optimize … development processes, and interfaces to support long-term project continuity What You Bring: • Active TS/SCI clearance with Full Scope Polygraph • Experience with GPU/CUDA development for high-performance applications • Familiarity with message queue implementations and communication protocols • Proficiency with Linux system programming and development environments • Strong analytical and problem-solving mindset • Excellent verbal and written communication More ❯
What you'll be doing Operational Leadership: Lead the end-to-end operations of Planet's global network (LAN, WAN, WLAN, SD-WAN, Cloud, and Data Centre interconnects), ensuring highavailability, resilience, and service excellence. SLA & KPI Management: Define, measure, and continuously improve network service SLAs, KPIs, and operational benchmarks across availability, latency, performance, and incident resolution. … consolidation, and improved service efficiency. Automation & Monitoring: Drive adoption of network monitoring, observability, and automation tools to proactively detect, predict, and resolve issues. Team Leadership: Build and mentor a high-performing global Network Operations team, instilling a culture of accountability, service excellence, and continuous learning. Executive Reporting: Provide regular performance, SLA, and risk updates to the Vice President - Infra … AWS, Azure, GCP) environments. Strong track record of vendor management, contract negotiation, and driving vendor accountability through SLA enforcement. Demonstrated ability to improve and sustain operational KPIs and network availability in mission-critical environments. Deep knowledge of routing, switching, firewalls (FortiGate, Check Point), load balancers (F5), and SD-WAN technologies. Experience in managing ISP peering, public IP allocations, redundancy More ❯
architecture), data warehouse, data mart, SQL and NoSQL data stores. • Utilize Microsoft Azure services including Azure Data Factory, Synapse Pipelines, Apache Spark Notebooks, Python, SQL, stored procedures to develop high performing data pipelines. • Redevelop existing SSIS extract, transform, load scripts to Azure Data Factory and Synapse Pipelines. • Identify, create, prepare data required for advanced analytics, visualization, reporting, and AI …/ML. • Implement data migration, data integrity, data quality, metadata management, and data security functions to optimize data pipelines. • Monitor and troubleshoot data related issues to maintain highavailability and performance. • Implement governance, build, deployment and monitoring to automate platform operation. • Actively support Agile DevOps process, including Program Increment planning. • Actively engage in continuous learning to increase relevant More ❯
role in ensuring the smooth and efficient functioning and development of our databases and pipelines. You will be responsible for developing, optimising and managing these databases and ensuring a high-level of data quality and integrity. You will be defining the principles and processes that underpin data management while ensuring we maintain the highest standards for data management, security … databases. Automating the management of new and existing databases to ensure that processes are repeatable and scalable. Create and manage database schemas, tables, stored procedures, indexes, and triggers. Ensuring highavailability of databases through clustering, replication, and other techniques. Implement and maintain data pipelines and ETL processes for real-time and batch data ingestion. Monitor database performance and More ❯
Newton-Le-Willows, Merseyside, North West, United Kingdom Hybrid / WFH Options
Linaker Limited
role in ensuring the smooth and efficient functioning and development of our databases and pipelines. You will be responsible for developing, optimising and managing these databases and ensuring a high-level of data quality and integrity. You will be defining the principles and processes that underpin data management while ensuring we maintain the highest standards for data management, security … databases. Automating the management of new and existing databases to ensure that processes are repeatable and scalable. Create and manage database schemas, tables, stored procedures, indexes, and triggers. Ensuring highavailability of databases through clustering, replication, and other techniques. Implement and maintain data pipelines and ETL processes for real-time and batch data ingestion. Monitor database performance and More ❯
We are seeking an experienced AWS Architect with valid eDV clearance to design, build, and maintain secure, scalable, and high-performing cloud infrastructure. This role involves working across development and operations teams to enable resilient platforms through modern cloud practices, automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise … Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and More ❯
We are seeking an experienced AWS Architect with valid eDV clearance to design, build, and maintain secure, scalable, and high-performing cloud infrastructure. This role involves working across development and operations teams to enable resilient platforms through modern cloud practices, automation, and container orchestration. You will be instrumental in shaping enterprise-ready cloud solutions by applying deep technical expertise … Implement strong security and access controls using IAM, Vault, and Secrets Manager Enhance platform observability using Prometheus, Grafana, and ELK Stack Collaborate with cross-functional teams to deliver robust, high-availability solutions Key Skills & Experience Extensive hands-on experience with AWS (Azure knowledge beneficial) Expertise in Terraform, CloudFormation, and automation tooling Strong containerisation skills with Kubernetes, Docker, and More ❯
to redefine how engineering teams detect, investigate and remediate cloud native deployments at scale by creating the industry's first AI SRE assistant. Kubernetes promises agility, elasticity, reliability and highavailability, but it also introduces complexity, high operational overhead, and cost overruns due to over provisioning of workloads. Traditional observability only surfaces the "what" - Komodor goes further … running a complex sales process. Experience building positive professional relationships with DevOps and executive leadership Skilled in managing time and resources Possess aptitude to learn quickly and establish credibility. High EQ and self-aware Passionate about growing your career around an established market with a ton of momentum Developing and maintaining an in-depth understanding of the Komodor platform … a way. You default to " yes and " vs "no" You are Reliable. Your teammates and your customers can depend on you. Our product teams know you will give them high fidelity feedback on how to improve our product. You bias to action and deliver in a timely manner. You have a high EQ; you're able to build More ❯
SCIM for user provisioning and identity management, as well as SAML 2.0 and Single Sign-On protocols. Must be capable of effectively implementing and troubleshooting SSO and SCIM integrations. High-Quality Code: Proven track record of writing clean, testable, and maintainable code that meets high standards of software quality. A developer who consistently levels-up the code base. … Problem-Solving & Scalability: Strong problem-solving skills, with the ability to develop scalable and durable features in high-availability environments. Adaptability & Communication: Ability to thrive in a fast-paced, dynamic environment, with excellent communication skills to support both internal teams and external customers. Must be able to articulate complex concepts to both technical and non-technical audiences effectively. More ❯
AWS Collaborate with development teams to integrate their applications into the infrastructure Monitor and troubleshoot production systems and resolve issues as necessary Continuously improve processes and tools to ensure highavailability and performance Stay current with new technologies and industry trends, continuously exploring new ways to improve our infrastructure Requirements 9+ years of experience in DevOps Engineering or More ❯
IBM Maximo, Azure Cloud, and RedHat OpenShift to support a large-scale CMMS environment. This role focuses on the administration, configuration, maintenance, and performance optimization of enterprise systems, ensuring highavailability, security compliance, and operational efficiency. RESPONSIBILITIES: Administer, configure, and maintain the IBM Maximo Application Suite (MAS 9) within an Azure-hosted and OpenShift containerized environment. Perform upgrades More ❯
installation, upgrades, and support of software and hardware in line with military systems architecture and IT policies. Configure and maintain Active Directory, Group Policy, and Exchange environments for secure, high-availability operations. Support workstation imaging, deployment, and management using enterprise-level tools. Perform HBSS configuration and deployment, and support system compliance through IAVA, STIGs, and ACAS standards. Troubleshoot More ❯
written communication skills, with experience working in a collaborative team environment. U.S. citizenship with a final Secret clearance. Preferred Qualifications: Expertise in setting up and managing Kubernetes clusters for high-availability systems. Experience developing web and RESTful services. Proficiency in cross-platform management (Windows and Unix/Linux). Familiarity with virtual machine environments and test-driven development. More ❯
blockchain monitoring infrastructure, while collaborating closely with security researchers and core contributors to operationalize cutting-edge threat detections. Responsibilities Deploy and manage detection bots across the Forta network, ensuring highavailability and performance. Operate and maintain Forta node infrastructure, including scan nodes, alert relays, and API integrations. Implement monitoring and alerting systems to ensure uptime, performance, and reliability More ❯
As a Senior RHEL Engineer, you will be responsible for leading the deployment and management of Red Hat OpenShift clusters, ensuring highavailability and security. This role involves developing and maintaining automation scripts for system provisioning, configuration management, and deployment. Responsibilities: Architect and deploy containerized applications using Red Hat OpenShift, ensuring optimal performance and scalability. Implement and optimize More ❯
an onsite position in Chantilly, VA. The Forta Administrator is responsible for deploying, configuring, monitoring, and maintaining detection bots and supporting infrastructure on the Forta network. This individual ensures highavailability, performance, and reliability of our bot operations while assisting engineering and security teams in scaling monitoring efforts. Key Responsibilities Deploy and maintain detection bots on the Forta More ❯
Tamworth, Staffordshire, West Midlands, United Kingdom
Oscar Associates (UK) Limited
3rd line infrastructure support, resolving complex technical issues quickly and effectively. Maintaining on-premise and hybrid infrastructure, with involvement in cloud migrations (Azure preferred). Monitoring system performance, ensuring highavailability and security across all platforms. Leading infrastructure upgrades and migrations with minimal operational disruption. Collaborating with internal teams and stakeholders to align IT solutions with business operations. More ❯
low-side network services that support the program. The successful candidate will possess a strong understanding of network architecture, networking protocols, virtualization technologies and security measures essential for maintaining highavailability and performance in hybrid cloud environments and will collaborate with various teams to ensure seamless integration of cloud services, optimizing network infrastructure to support application performance and More ❯