Remote Distributed Applications Jobs in Southampton

2 of 2 Remote Distributed Applications Jobs in Southampton

Site Reliability Engineer

Southampton, Hampshire, United Kingdom
Hybrid / WFH Options
NICE
Run the production environment by monitoring availability and taking a holistic view of system health Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our … capabilities forward, getting ahead of customer needs, and innovating to continually improve Provide primary operational support and engineering for multiple large distributed software applications How will you make an impact? Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault … with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack, Cloudwatch). Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems. Experience of Incident management and blameless postmortems that includes driving the incident response efforts during outages and other critical incidents, resolution, and communication More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Site Reliability Engineer

Southampton, Hampshire, South East, United Kingdom
Hybrid / WFH Options
Spectrum It Recruitment Limited
to anticipate user demands and drive innovation. Additionally, you'll take the lead in providing operational support and technical oversight for several large-scale distributed applications. How You'll Contribute: Monitor and interpret system and application metrics to fine-tune performance and troubleshoot issues effectively Collaborate closely with developers … Skilled in using observability and monitoring tools such as Prometheus, Grafana, ELK stack, or AWS CloudWatch Excellent analytical and troubleshooting abilities, especially within complex distributed systems Proven experience handling incident management and conducting blameless postmortems, including leading cross-functional teams through resolution and communication during critical outages Benefits Life More ❯
Employment Type: Permanent, Work From Home
Posted: