GCP Engineer
Job Description: GCP Platform Engineer / SRE
1. In‐Scope Technologies / Products
- GCP Cloud
- GCP API Management (Apigee) – Infrastructure
- GCP Networking
- GCP Cloud Armor
- GCP Load Balancer
- GCP Kubernetes Service (GKE)
- GCP Storage
- HashiCorp Terraform
- HashiCorp Vault
- Harness
- Containers
- Backstage
2. Skillsets Required
Cloud Platform Engineering
- Proven experience designing, building, and operating secure, automated cloud platform capabilities with a focus on Azure (with readiness to cross‐train in GCP as required).
Infrastructure as Code (IaC)
- Strong proficiency with Terraform (minimum).
- Experience with modern CI/CD systems such as GitHub Actions, Harness, and Jenkins.
API Management
- Deep understanding of GCP API Management (Apigee) infrastructure and API Gateway solutions.
- Familiarity with API design and security standards (REST/OpenAPI, authentication/authorisation, mTLS, certificate lifecycle management).
Networking & Security
- Hands‐on experience with GCP Cloud Armor, GCP Networking, and implementation of secure‐by‐design controls across platform lifecycle.
Containers & Orchestration
- Experience with GKE, container runtimes, and service mesh patterns (e.g., Istio).
Automation & Observability
- Ability to implement actionable observability solutions.
- Performance tuning and automation to reduce toil.
- Experience defining and operating to SLOs/SLIs.
Scripting & Tooling
- Proficiency in Bash, PowerShell, or Python.
- Familiarity with HashiCorp Vault, Harness, and Backstage is desirable.
Collaboration & Mentoring
- Ability to mentor engineers, contribute to communities of practice, and uphold platform engineering standards.
Certifications
- GCP certifications are desirable.
3. SRE Role – Additional Requirements
- Relevant GCP certifications aligned to the service.
- Strong DevOps background with experience in IaC and CI/CD pipelines (Terraform, Jenkins).
- Ability to quickly understand, update, and write code (Python, Groovy, Bash, PowerShell).
- Demonstrable expertise in modular, maintainable IaC design.
- Strong understanding of cloud security, networking, and APIs.
- Excellent troubleshooting and problem‐solving capability.
- Hands‐on experience with observability tooling (including Observability‐as‐Code and SLO‐based Dynatrace monitoring).
- Strong understanding of source‐control practices and collaborative engineering.
- Experience developing and administering Kubernetes clusters in production.
- Strong focus on automation and toil reduction.
- Good knowledge of incident management and issue resolution processes.
- Demonstrated passion for continuous learning and engineering excellence.
4. Scope of Services
As Infrastructure Engineers within the Integration GCP team, responsibilities include:
Design, Build & Operate
- Deliver and maintain secure, automated GCP API Management platform capabilities, supporting both API Gateway and broader integration products.
Platform Enablement
- Enable product teams to deliver API‐first services using reusable patterns and robust integration tools.
Infrastructure Automation
- Develop and maintain IaC solutions for provisioning and managing Azure and GCP resources, ensuring consistency, repeatability, and compliance.
Security & Compliance
- Embed security best practices throughout the platform lifecycle to safeguard organisational and customer data.
Performance & Reliability
- Define, monitor, and operate against SLOs/SLIs to ensure availability, performance, and resilience.
Continuous Improvement
- Drive automation, observability enhancement, and performance tuning to reduce manual effort and improve reliability.
Collaboration
- Work closely with architecture and feature teams to evolve the cloud roadmap and platform products.
- Contribute to documentation, technical enablement, and engineering standards.
Mentoring
- Mentor team members and promote a culture of continuous learning.
5. SRE Role – Additional Service Responsibilities
- Hands‐on engineering ownership of IaC and CI/CD pipeline‐based products and services.
- Respond to change requests, implement enhancements, and improve reliability and customer experience.
- Observe, investigate, and resolve service issues using an engineering mindset – resolving issues through code, automation, and root‐cause elimination.
- Implement automation to remove manual toil by leveraging existing cloud tooling or integrating new technologies.