procedures. Implement proactive monitoring measures to detect and prevent issues. Monitor & Troubleshoot Troubleshoot system issues using logs, monitoring tools, and a methodical approach. Oversee and enhance system monitoring with Nagios, with a transition to Datadog. Incident Management Support incident management processes, including post-mortems and follow-up actions. Communicate outcomes with customers clearly and effectively. What We’re Looking For … security best practices. Version control experience (e.g., Git). Strong troubleshooting and root cause analysis skills. Desirable Skills Experience with Kubernetes and/or other cloud platforms. Familiarity with Nagios, Datadog, or similar monitoring tools. Exposure to CI/CD systems such as TeamCity, AWS CodeBuild, AWS CodePipeline, or ArgoCD. Personal Attributes Proactive, curious, and process-driven. Enjoys collaboration and More ❯