pipelines • AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR • Kubernetes Helm Kops Ingress/Egress • Terraform Deployment of AWS Resources Pipelines OCI • Observability ELK Dynatrace Prometheus • Others Vault RedHat As an equal opportunities’ employer, we welcome applications from individuals of all backgrounds. However, for you to be eligible more »
pipelines • AWS S3 RDS Route 53 IAM EKS Secrets Manager ECR • Kubernetes Helm Kops Ingress/Egress • Terraform Deployment of AWS Resources Pipelines OCI • Observability ELK Dynatrace Prometheus • Others Vault RedHat As an equal opportunities’ employer, we welcome applications from individuals of all backgrounds. However, for you to be eligible more »
stack developer, before shifting your focus to SRE/Platform Engineering (Java preferred) extensive experience with AWS, Kubernetes, Terraform, CI/CD tools strong observability experience, ideally with more modern approaches like Prometheus, Grafana, Open Telemetry comfortable with databases exposure to Kafka would be ideal more »
design and implementation experience; Understanding of Infrastructure as Code frameworks e.g. Terraform; Working experience of designing and implementing logging and monitoring stacks to aid observability e.g. Grafana, Cloudwatch, PagerDuty, ELK stack Good scripting skills in one of PowerShell, Bash/Shell and Python or another appropriate language; Experience of moving more »
Rate: £650-700/day (negiotiable) Location: Remote - Occasional days in the office (London) Duration: 6 months + Extensions IR35: Inside IR35 My client is looking for a network engineer competent in following technologies: Designing/Recommendation telemetry infrastructure for more »
Kafka and stream processing services Comfortable with at least one of our other centre-of-gravity languages (Go, Python, Java, Kotlin) Familiarity with our observability stack (Grafana, Prometheus, Honeycomb) Experience in Implementing Site Reliability Engineer (SRE) principles in daily work Product-focused mindset with a strong emphasis on creating impactful more »
required: Strong Cloud experience with AWS and AWS Services Containerisation/Orchestration with Kubernetes Strong understanding of IaC with Terraform Wealth of Monitoring and Observability experience Knowledge of Security/DevSecOps practices This position can offer £90-110K, plus benefits, and operates a hybrid working model (with 3 office more »
Define and follow software standards and processes from peer code reviews to coding standards Follow best DevOps and DevSecOps practices, to ensure successful delivery, observability, operation and security of software in production Work with test and operations teams to troubleshoot and resolve issues. Write unit and automated functional tests. Ensure more »
have knowledge of CI/CD tools. The tools can be Team City, Jenkins, IBM Tool Chain etc Should have knowledge of APM and observability tools. The tools can be Instana, DX APM, App D, New Relic Should have knowledge of server automation tools like Salt, Puppet, Chef, and Ansible more »
Surrey, England, United Kingdom Hybrid / WFH Options
Roc Search
new services and features is optimal in the context of their tech ecosystem, considering various functional and non-functional attributes, such as performance, availability, observability, security and cost. You will maintain a strategic vision, ensuring that a fast-paced development cycle converges on your preferred target architecture. You will become more »
approach ensures high-quality code, fosters knowledge sharing, and strengthens our collective expertise You play a pivotal role in driving automation, fine-tuning, enhancing observability, and ensuring reproducibility across our platform. Your contributions are instrumental in maintaining the platform's excellence and reliability Key Requirements: At least 3 years of more »
is looking an experienced Site Reliabiliy Engineer to assist with the growing demand for their services. If you're an advocate for monitoring and observability practices who enjoys working closely with product teams to ensure systems are secure, scalable and reliable then this could be the perfect next challenge. What more »
enhancing efficiency. Enforce adherence to digital principles, ensuring the integrity, security, and compliance of solutions while meeting both functional and non-functional requirements. Embed observability into solutions, monitoring production performance, resolving incidents, and addressing underlying risks and issues. Advocate for client requirements while maintaining discretion and confidentiality. Standardise best practices more »
to improve productivity. • Experience of integrating with API services • Sound knowledge of mobile applications - SOLID principles and design patterns • Working experience in monitoring and observability, automated CI/CD pipelines more »
their web platform Work closely with Product teams and Product managers to ensure alignment with the product roadmap Coaching engineers in best practices Advocating observability, serveless architecture and automated testing practices What you'll need: 5+ years in architecting and building web applications Expertise working with React, Node and AWS more »
engineering mindset is crucial so we're much more interested if you can talk about the successes (and failures) of impacting problems like latency, observability, performance tuning etc. If you also can discuss a track record in building B2B SaaS products, that would be excellent. They are a startup and more »
designing, developing, and deploying applications based on microservices. Event-Driven Systems: Hands-on experience with Apache Kafka or similar distributed messaging systems. Monitoring and Observability: Familiarity with monitoring tools like Prometheus, Grafana, or Victoria Metrics. Database Technologies: Experience working with various databases, including: - TSDB: InfluxDB, TimescaleDB - GDB: Dgraph, Neo4j This more »
engineering mindset is crucial so we're much more interested if you can talk about the successes (and failures) of impacting problems like latency, observability, performance tuning etc. If you also can discuss a track record in building B2B SaaS products, that would be excellent. They are a startup and more »
team of versatile, skilled developers and product/design teams. ⭐ You'll set and maintain best practices for the team around development, security, testability, observability and performance, whilst also identifying and resolving issues as a wider team. ⭐The development team pride themselves on a family-feel culture, prioritising excellence over more »
shape how everything runs at THINKalpha and be a leading voice in how we work and build our infrastructure. Your Work Configure and maintain observability tooling with Datadog and PagerDuty (Slack channels) Contribute to our IaC codebase by creating and maintaining Terraform and Ansible modules, and participate in the review … tools. Experience with both on-premise/colocated servers as well as cloud infrastructure, and hybrid deployments spanning both types of environments. Experience with observability platforms (e.g., DataDog) and alarm systems (e.g., PagerDuty) >Nice to have< Coding background in at least one language (Node, JavaScript, Python, C++, etc) Understanding of more »
to join their London technology team. This is a new role for the team and they are looking to hire someone with very strong observability and monitoring skills. The technology stack is rapidly evolving across the company and all of the infrastructure and application stack is now being built using … Microservices architecture. The successful Production Engineer will be embedded within some of the core development teams, finding new ways to improve monitoring and observability workflows using tools like Prometheus and Grafana. This role will involve direct interaction with traders, portfolio managers and senior stakeholders across the business so strong leadership … and interpersonal skills are required for this position. Key Requirements: Expert scripting skills with Powershell, Bash, Python etc. In depth knowledge of monitoring & observability tools such as Promethus, Grafana and OpenTelemetry Strong knowledge of CI/CD tooling Experience with metrics and tracing instrumentation, such as LGTM and PromQL Knowledge more »
Are you a visionary problem-solver with ability in transforming legacy observability setups into cutting-edge systems? Do you excel at designing innovative solutions that drive business value? We're on the hunt for a talented Kubernetes/Monitoring Solutions Architect based in the UK to lead our team in … revolutionizing our observability infrastructure within a data analytics company. The role is fully remote and may require some out of hours work to align with the timezone differences. Kubernetes & Monitoring Architect Responsibilities: Assess and comprehend existing legacy observability tools and infrastructure prevalent in the business, including Splunk, AppDynamics, Cribl, Zabbix … Thousand Eyes, and Service Now Event Management Collaborate closely with cross-functional teams to define requirements and objectives for future observability solutions, with a keen focus on noise reduction, seamless integration of business context, and harnessing the power of AIOps/Self-Healing capabilities Craft and articulate innovative designs for more »
infrastructure as code (IaC) tools (Terraform) Solid understanding of CI/CD pipelines, version control systems, and release management practices. Familiarity with monitoring and observability tools (Prometheus, Grafana, Dynatrace) Knowledge of security best practices, compliance standards, and incident response protocols. Strong analytical and problem-solving skills, with the ability to more »
understanding of Google Cloud (GCP) Deep understanding of SRE ethos and principles Vast amounts of Terraform experience Solid experience with Python Solid experience of Observability tooling. Good experience in dashboard creation/data visualisation using tools such as Google Looker, or Grafana Strong CI/CD experience Strong containerisation experience more »
Site Reliability Engineer (SRE) within the Retirement platform where your main responsibilities will be to work with our existing SRE team to ensure strong observability across our services utilizing tools such as Dynatrace and Splunk. You will work closely with the wider team to embed SRE principles of delivering secure … teams to understand root causes of incidents. Striving to remove manual tasks (toil) through automation and the improvement of processes. Specific activities will include: Observability, investigating & fixing service issues, with an engineering mentality - resolving via code changes in some cases directly and in others by working with software engineers. Increasing … our observability by identifying gaps and serving as an expert on discussion around logging, meta-data, and response codes, creating dashboards and custom alerts/metrics in our APM tools (e.g. Dynatrace). Implementing further automation and reducing toil. Hands-on engineering, maintaining our Infrastructure as Code and CI/ more »