to support low latency applications. * Practical and working knowledge of IP networking and data flow within distributed systems. * Experience integrating ELK with packet capture/analysis tools to enhance observability of real-time systems. * Hands-on expertise in data pipeline creation, ingestion strategies, and performance tuning of Logstash and Beats for scalable telemetry. * Knowledge of Corvil and/or Pico … a deep understanding of the demands of high-frequency and algorithmic trading environments. * Working knowledge of security best practices, including RBAC, TLS, and audit logging in Elasticsearch. * Experience with observability platforms such as ITRS Geneos and their integration with ELK is a strong plus. * Comfortable with multi-site replication, cross-cluster search, and disaster recovery configurations for global deployment. * Strong … understanding of Linux systems, containers, and cloud-native observability stacks. * Organized, self-driven, and able to manage priorities in a dynamic, high-performance trading environment. More ❯
with the Enterprise Monitoring & Alerting (EMAS) team to deliver a transformative initiative aimed at maximising Dynatrace capabilities. We are looking for a skilled Dynatrace Admin/Consultant to enable observability across complex, hybrid cloud environments. Essential Skills: To thrive in this role, you must demonstrate extensive experience in designing and configuring within Dynatrace, including: Application Performance Monitoring Proficiency in Dynatrace … assets for monitoring. Work with EMAS to analyse Dynatrace coverage of these critical assets. Identify opportunities for enhancement in monitoring configurations across crucial applications. Review roles and responsibilities concerning observability and propose improvements focused on Operational Resilience. Contribute to establishing an automated end-to-end business flow for key business processes within the Dynatrace toolset. Ensure optimal alerting configurations in … collaboration with Application Stewards and SREs. Participate in workshops with third-party software suppliers to review observability standards. Bonus Points: Skills in correlating events across the full stack for root cause analysis. Key Attributes: Ability to manage competing priorities in a fast-paced environment. Flexibility and a pragmatic approach to problem-solving. A delivery-oriented mindset coupled with a can More ❯
hybrid 2/3 days onsite Rate: up to £570 p/d Umbrella inside IR35 Role Overview Our client is looking for a Splunk ITSI Expertise in building Observability level 4 including Managing events and component level metrics, Metrics-logs and Traces, time series topology and AI/mL Models To achieve Alerts, Comprehensive Dashboards, Root cause analysis, Business … Impact analysis and predictive and preventative insights Qualifications: 10 + years' experience builds observability level 4. Expertise in Splunk platform, including Essentials, Enterprise Security, and ITSI. Strong understanding of ITSM concepts (ITIL, ServiceNow) and service management processes. Proficiency in Scripting and programming languages like Python, Shell Scripting, or Perl (beneficial). Excellent analytical and problem-solving skills. Solid understanding of More ❯
Northampton, England, United Kingdom Hybrid / WFH Options
eTeam
you. Role Title: Splunk ITSI Expert Location: Birmingham/Sheffield- Hybrid Duration: 28/11/2025 Rate: £559/Day(Inside IR35) Role Description: Responsibilities: • Expertise in building Observability level 4 including Manage events and component level metrics, Metrics-logs and Traces, time series topology and AI/mL Models To achieve Alerts, Comprehensive Dashboards, Root cause analysis, Business … Impact analysis and predictive and preventative insights Qualifications: 10 + years’ experience builds observability level 4. Expertise in Splunk platform, including Essentials, Enterprise Security, and ITSI. Strong understanding of ITSM concepts (ITIL, ServiceNow) and service management processes. Proficiency in scripting and programming languages like Python, Shell Scripting, or Perl (beneficial). Excellent analytical and problem-solving skills. Solid understanding of More ❯
kettering, midlands, united kingdom Hybrid / WFH Options
eTeam
you. Role Title: Splunk ITSI Expert Location: Birmingham/Sheffield- Hybrid Duration: 28/11/2025 Rate: £559/Day(Inside IR35) Role Description: Responsibilities: • Expertise in building Observability level 4 including Manage events and component level metrics, Metrics-logs and Traces, time series topology and AI/mL Models To achieve Alerts, Comprehensive Dashboards, Root cause analysis, Business … Impact analysis and predictive and preventative insights Qualifications: 10 + years’ experience builds observability level 4. Expertise in Splunk platform, including Essentials, Enterprise Security, and ITSI. Strong understanding of ITSM concepts (ITIL, ServiceNow) and service management processes. Proficiency in scripting and programming languages like Python, Shell Scripting, or Perl (beneficial). Excellent analytical and problem-solving skills. Solid understanding of More ❯
milton keynes, south east england, united kingdom Hybrid / WFH Options
eTeam
you. Role Title: Splunk ITSI Expert Location: Birmingham/Sheffield- Hybrid Duration: 28/11/2025 Rate: £559/Day(Inside IR35) Role Description: Responsibilities: • Expertise in building Observability level 4 including Manage events and component level metrics, Metrics-logs and Traces, time series topology and AI/mL Models To achieve Alerts, Comprehensive Dashboards, Root cause analysis, Business … Impact analysis and predictive and preventative insights Qualifications: 10 + years’ experience builds observability level 4. Expertise in Splunk platform, including Essentials, Enterprise Security, and ITSI. Strong understanding of ITSM concepts (ITIL, ServiceNow) and service management processes. Proficiency in scripting and programming languages like Python, Shell Scripting, or Perl (beneficial). Excellent analytical and problem-solving skills. Solid understanding of More ❯
have a strong grounding in SOLID principles , OOP , and TDD (NUnit preferred), alongside advanced Git skills. Nice-to-haves include experience with Entity Framework, CI/CD pipelines, Azure, observability practices, and diagnosing production issues. Benefits 25 days holiday + bank holidays + birthday off (increases with service) £500 annual training budget + 2 conference days per year Profit share … this vacancy only. We look forward to hearing from you! Key Skills: C#, .NET 6+, SQL Server, REST APIs, GraphQL, Git, Agile, TDD (NUnit), Entity Framework, Azure, Design Patterns, Observability, Security Best Practices More ❯
ensuring alignment with business needs and regulatory requirements. Own the design and optimisation of the Azure Databricks platform, focusing on performance, resilience, and compliance. Implement change controls, lineage, and observability to detect and resolve issues. Maintain clear documentation of data pipelines, models, and governance processes. Implement tools to improve data quality, efficiency, and adoption. Support AI/ML delivery by … in Azure and Databricks Advanced SQL skills with experience in large-scale environments Proven track record with lineage, documentation, and governance (Unity Catalogue, Purview) Experience implementing data quality and observability controls Familiarity with ITIL change and incident management Hands-on experience enabling AI/ML in a data platform Strong ETL/ELT engineering skills Desirable Experience with Python and More ❯
ensuring alignment with business needs and regulatory requirements. Own the design and optimisation of the Azure Databricks platform, focusing on performance, resilience, and compliance. Implement change controls, lineage, and observability to detect and resolve issues. Maintain clear documentation of data pipelines, models, and governance processes. Implement tools to improve data quality, efficiency, and adoption. Support AI/ML delivery by … in Azure and Databricks Advanced SQL skills with experience in large-scale environments Proven track record with lineage, documentation, and governance (Unity Catalogue, Purview) Experience implementing data quality and observability controls Familiarity with ITIL change and incident management Hands-on experience enabling AI/ML in a data platform Strong ETL/ELT engineering skills Desirable Experience with Python and More ❯
deliver impact. Write clean, testable, and maintainable code with a focus on developer-driven quality. Apply strong CS fundamentals to design scalable, reliable, and efficient systems. Contribute to monitoring, observability, and performance optimisation of production systems. Work closely with multidisciplinary teams, sharing knowledge and solving problems collaboratively. Adapt quickly to changing priorities, delivering high-quality results at pace. What We … at scale. Full-stack experience or deep expertise in backend development. Strong CS fundamentals and fluency in Python. Experience owning and operating complex systems in production. Clear grasp of observability, monitoring, and performance tuning. Commitment to writing high-quality, testable code and improving engineering practices. Relevant degree (e.g. Computer Science, Mathematics, Engineering or similar). Pragmatic, collaborative mindset with strong More ❯
mentorship on the practical application of AI/LLM techniques, defining best practices for everything from prompt engineering to fine-tuning and RAG. Ensure Reliability: Implement and own LLM observability and evaluation systems to ensure our AI features are reliable, performant, and continuously improving. Collaborate Cross-Functionally: Work closely with data scientists, platform engineers, and research teams to build cohesive … platforms, or reinforcement learning from human feedback (RLHF). An interest or experience in synthetic data, human-in-the-loop systems, or AI alignment. Experience building robust monitoring and observability for AI/ML systems in production. Why Prolific is a great place to work We've built a unique platform that connects researchers and companies with a global pool More ❯
A client of Innova Solutions is immediately hiring a Remediation Engineer. Position Type: Full-Time, Contract Duration: 6+ months Location: Maryland Heights, MO 63043 (Hybrid) As a Remediation Engineer, you will: • This person will spend most of their time analyzing More ❯
Northbrook, Illinois, United States Hybrid / WFH Options
Medline Industries - Transportation & Operations
Job Summary The Splunk System Administrator will manage, maintain and support an enterprise level Splunk Cloud installation.You will act as Splunk Subject Matter Expert (SME), configure and maintain Splunk, analyze performance data, and provide insights to improve monitoring of Medline More ❯
Senior Software Engineer Robotics (C++/Python) Location: Greater Bristol Area, UK (On-site) Type: Full-time, Permanent Salary: £65,000 £95,000 + equity + benefits About the Opportunity Are you a Senior Software Engineer with strong C++ and More ❯
A client of Innova Solutions is immediately hiring a Remediation Engineer. Position Type: Full-Time, Contract Duration: 6+ months Location: Maryland Heights, MO 63043 (Hybrid) As a Remediation Engineer, you will: This person will spend most of their time analyzing More ❯
Brighton, East Sussex, England, United Kingdom Hybrid / WFH Options
Harnham - Data & Analytics Recruitment
Data Engineering Manager Hybrid | Brighton (Once Per Week) Up to £90k + Benefits Are you an experienced Data Engineering Manager with a passion for leading high-performing teams and shaping modern data infrastructure? I'm working with an exciting, fast More ❯
Per Day Inside IR35 To apply, email: THE OPPORTUNITY We are looking for a Dynatrace Subject Matter Expert (Data Resilience). You'll play a critical role in improving observability, resilience, and performance monitoring across hybrid cloud environments using the Dynatrace platform. THE ROLE Collaborate with Application Stewards & Site Reliability Engineers (SREs) to confirm monitoring requirements for critical assets. Analyse … optimise, and automate monitoring within the Dynatrace toolset. Provide AI-driven insights (via Davis AI) for anomaly detection, root cause analysis, and proactive recommendations. Support operational resilience by embedding observability standards and best practices. Engage in workshops with third-party suppliers to review and improve observability standards. TECH STACK/REQUIREMENTS Strong expertise in Dynatrace (SaaS & On-Premises) including: Application … Anomaly Detection Profiles Alerting Rules & Profiles Synthetic & Log Monitoring Real User Monitoring (RUM) DQL & Grail for advanced data analytics API integrations with complex systems Experience working in resilience/observability engineering. Strong communication skills and ability to work in high-pressure environments. Flexible, pragmatic, and delivery-focused with a can-do attitude. Experience in Financial Services highly desirable Hands on More ❯
Overview Intercom is the AI Customer Service company on a mission to help businesses provide incredible customer experiences. Our AI agent Fin, the most advanced customer service AI agent on the market, lets businesses deliver always-on, impeccable customer service More ❯
Brooklyn Park, Minnesota, United States Hybrid / WFH Options
Innova
A client of Innova Solutions is immediately hiring for a Senior Native App Engineer position. Position type: Fulltime Contract Duration: 6 months Location: Minneapolis, MN(Hybrid) Job Description: We're looking for a Senior Native App Engineer to help shape More ❯
We're helping organizations deploy supergraphs at scale using Apollo Federation. Are you passionate about building secure, high-performance, and reliable infrastructure? Do you thrive on designing systems that meet the needs of the most demanding GraphQL workloads in the More ❯
Location: UK (occasional on-site) Duration: 6 months (initial) IR35: Inside IR35 Start Date: Immediate Overview: We are seeking an experienced Dynatrace SME to join a major monitoring and observability programme with a globally recognised technology leader. This role will suit someone with strong Dynatrace and DQL expertise , capable of driving performance monitoring strategies across complex, distributed environments. Key Responsibilities … Act as the subject matter expert for Dynatrace monitoring and observability Develop and optimise DQL queries , dashboards, and reports for actionable insights Define and implement end-to-end monitoring strategies to support application and infrastructure performance Drive best practices around alerting, automation, and root cause analysis Collaborate with engineering, operations, and business stakeholders to ensure adoption of monitoring solutions Support … hands-on experience with Dynatrace , including advanced DQL skills Proven track record in application and infrastructure monitoring within complex, distributed environments Experience in designing dashboards, performance monitoring frameworks, and observability solutions Knowledge of incident management and performance tuning best practices Strong communication skills and ability to influence stakeholders What’s on Offer: Initial 6 -month contract with potential extension Opportunity More ❯
Location: UK (occasional on-site) Duration: 6 months (initial) IR35: Inside IR35 Start Date: Immediate Overview: We are seeking an experienced Dynatrace SME to join a major monitoring and observability programme with a globally recognised technology leader. This role will suit someone with strong Dynatrace and DQL expertise , capable of driving performance monitoring strategies across complex, distributed environments. Key Responsibilities … Act as the subject matter expert for Dynatrace monitoring and observability Develop and optimise DQL queries , dashboards, and reports for actionable insights Define and implement end-to-end monitoring strategies to support application and infrastructure performance Drive best practices around alerting, automation, and root cause analysis Collaborate with engineering, operations, and business stakeholders to ensure adoption of monitoring solutions Support … hands-on experience with Dynatrace , including advanced DQL skills Proven track record in application and infrastructure monitoring within complex, distributed environments Experience in designing dashboards, performance monitoring frameworks, and observability solutions Knowledge of incident management and performance tuning best practices Strong communication skills and ability to influence stakeholders What’s on Offer: Initial 6 -month contract with potential extension Opportunity More ❯
Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique architecture powers in-stream analytics without reliance on expensive indexing or hot storage. We specialize in comprehensive monitoring of logs, metrics, traces, and security events with features such as APM, RUM, SIEM, Kubernetes monitoring, and more-enhancing operational efficiency and reducing observability … strategic accounts, and collaborate closely with Sales, SEs, TAMs, Support, and R&D. The ideal candidate brings both strong project leadership skills and a deep understanding of SaaS or observability customer onboarding . Responsibilities: Lead, mentor, and grow a global team of Onboarding Project Managers across regions and verticals Define and monitor KPIs for onboarding success, time-to-value, and … Salesforce, Jira, Confluence, and Intercom Excellent interpersonal skills with the ability to lead both technical teams and engage directly with customer stakeholders Experience working in B2B software; experience in observability/security platforms - huge advantage PMP or similar project management certification - advantage Customer-centric & Collaboration-driven Mindset Passion for delivering exceptional customer experiences and measurable outcomes Ability to coach teams More ❯
high-performance connectivity for over 50,000 users and their devices through the implementation of VPN, SASE, and Zero-Trust Network Access (ZTNA) technologies. With a focus on automation, observability, and user experience, this position plays a vital role in ensuring seamless and secure connectivity across third-party, hybrid, and internal environments. Key Responsibilities: Strategy & Architecture Leadership: Define and lead … provisioning, policy enforcement, and configuration management through Infrastructure as Code (IaC) and zero-touch deployment practices. Apply Site Reliability Engineering (SRE) principles to improve performance, availability, and troubleshooting. Establish observability practices across all access points with real-time metrics, logs, and telemetry. Security, Compliance & Governance: Ensure compliance with corporate security and privacy policies through proactive monitoring and enforcement. Partner with … trust architecture, network segmentation, and endpoint-based access control. Proven ability to scale remote connectivity solutions to tens of thousands of users and devices. Experience with IaC, network automation, observability tooling, and SRE methodologies. Preferred Qualifications: Certifications such as CCNP, CCIE, PCNSE, Zscaler Certified, or equivalent. Familiarity with secure hybrid work and cloud networking models. Background in network performance optimization More ❯
Microsoft Defender, Purview, Symantec, CrowdStrike, or equivalent. Platform Integration & Automation: Drive automation for device onboarding, compliance validation, and health monitoring. Ensure endpoint telemetry is integrated with SIEM platforms and observability stacks. Partner with platform and infrastructure teams to enforce secure device baselines and policies. Build self-healing, zero-trust-aligned architectures for secure device management. Observability & Event Management: Implement real … time observability of endpoint health, risk exposure, and threat posture. Integrate with cybersecurity event and incident management pipelines for early detection and rapid response. Collaborate with the cyber and incident response teams to streamline investigation and containment. Ensure high-fidelity logging and alerting from all edge device sources. Compliance & Risk Management: Ensure devices meet internal policies and external regulatory standards … implementing and operating tools such as Microsoft Defender, Purview, Symantec, or CrowdStrike. Deep technical understanding of endpoint protection, mobile security, and compliance tooling. Strong background in automation, scripting, and observability practices. Experience working with large, global device fleets and BYOD models. Preferred Qualifications: Familiarity with zero-trust security models and endpoint detection and response (EDR). Certifications such as CISSP More ❯