DevOps Engineer (AWS)
Role Overview:
We are looking for a DevOps Engineer to join our DevOps team. You will focus exclusively on AWS and Amazon EKS, automating infrastructure with Terraform, orchestrating containers via EKS, building robust CI/CD with GitOps, and implementing strong monitoring/observability + AWS EKS IAM concepts to support secure, high-performance, cost-optimised systems. You will work closely with developers and SREs in an automation-first, collaborative culture.
Key Responsibilities:
Build and maintain robust CI/CD pipelines for automated testing, deployment, and rollouts in AWS/EKS.
Manage and optimize Amazon EKS clusters for container orchestration in production.
Implement and evolve Infrastructure as Code (IaC) using Terraform on AWS.
Support live production AWS/EKS environments, including on-call rotation, incident response, root cause analysis, and blameless post-mortems.
Automate provisioning, configuration, scaling, observability, and secure workload access across AWS services.
Collaborate on improving system reliability, observability, security (DevSecOps/EKS IAM), and AWS cost optimization.
Qualifications:
Strong Linux system administration (file systems, processes, networking, troubleshooting, bash scripting).
Scripting & programming in Python and Bash/Shell; Go/Typescript is a nice-to-have.
Git workflows, branching strategies, and version control best practices.
Proficiency with AWS (core services like EC2, VPC, IAM, S3, RDS, Lambda, EKS, CloudWatch, etc.; production-grade experience required).
Solid experience with Amazon EKS (cluster creation/management, node groups, add-ons, upgrades, scaling, troubleshooting).
Infrastructure as Code (IaC) expertise, with Terraform strongly preferred (AWS provider/modules experience a must).
Familiarity with Helm charts and Kustomize (for packaging, templating, versioning, and deploying Kubernetes applications/add-ons in EKS environments).
Hands-on building and maintaining CI/CD pipelines using GitHub Actions or GitLab CI/CD, plus ArgoCD (GitOps).
Experience with monitoring/observability tools (Prometheus, Grafana, ELK stack, Datadog, OpenTelemetry) – including metrics, logs, traces, dashboards, alerting, and integration with AWS services (e.g., CloudWatch, X-Ray).
Solid understanding of AWS EKS IAM concepts, especially for EKS, including:
○ IAM Roles for Service Accounts (IRSA)
○ EKS Access Entries and Access Policies (Cluster Access Management API)
○ EKS Cluster IAM Role and Node IAM Role configuration/management
○ Principle of least privilege, temporary credentials, avoiding over-permissive roles, credential isolation, and integration between IAM and Kubernetes RBACExperience analysing and optimising AWS costs using tools such as AWS Cost Explorer, Compute Optimizer, and EKS workload right-sizing.
Skills:
3+ years of hands-on experience in DevOps, Site Reliability Engineering (SRE), infrastructure engineering, or closely related roles (focused on AWS/EKS environments).
Proven track record building and maintaining production-grade CI/CD pipelines with AWS/K8S.
Bachelor's degree in Computer Science, Engineering, IT or equivalent practical experience (degree-optional – we value hands-on AWS/EKS impact most).
Excellent problem-solving and troubleshooting, especially under pressure in production AWS/EKS setups.
Automation-first mindset with a passion.
Growth mindset – eagerness to continuously learn, adapt to evolving AWS/EKS technologies and best practices, embrace challenges as opportunities for improvement, and turn incidents/post-mortems into actionable learning for personal and team growth.
Proven ability to collaborate effectively with cross-functional stakeholders, including engineering, SRE, security, and product teams.
Maintain clear technical documentation, runbooks, and operational playbooks for production systems.
We Offer:
Experience a dynamic and team-orientated work environment.
Opportunities for personal growth and learning.
An open, inclusive and supportive team where you will be valued, and your suggestions will be welcome.
26 days paid holiday per year, in addition to local public holidays.
Competitive salary.
Risk Benefits such as pension, Life Assurance (4x annual salary), Private Medical Insurance.
Team Building activities.
Local discounts and more...!
- Department
- Corporate
- Locations
- Hammersmith
- Remote status
- Hybrid
- Language requirement
- Chinese