6 to 8 Years Relevant Experience
We are seeking an experienced and highly skilled AWS EKS Kubernetes SME to lead and manage our container orchestration environment on Amazon EKS within VMware Cloud on AWS. This role focuses on the design, deployment, automation, and management of secure, scalable, and highly available Kubernetes workloads in production.
The ideal candidate will bring deep expertise in Kubernetes internals, EKS architecture, CI/CD, infrastructure as code, observability, and container security best practices.
Key Responsibilities:
- Design, deploy, and manage Amazon EKS clusters ensuring high availability, security, and scalability.
- Lead workload migrations to EKS from on-prem environments or other container orchestration tools.
- Develop and implement Infrastructure as Code (IaC) using Terraform, AWS CloudFormation, or AWS CDK.
- Architect and optimize CI/CD pipelines for microservices using tools such as Jenkins, GitHub Actions, CodePipeline, and GitLab CI.
- Define and enforce Kubernetes best practices (namespaces, RBAC, resource quotas, affinity/anti-affinity rules).
- Implement and manage a full observability stack, including Prometheus, Grafana, Fluentd, Loki, and ELK for monitoring, logging, and alerting.
- Enforce security through IAM roles for service accounts, OPA/Gatekeeper, Pod Security Policies, image scanning, and more.
- Enable detailed auditing, logging, tracing, and performance tuning of EKS workloads.
- Provide onboarding, deployment models, and technical guidance to application teams.
- Troubleshoot complex Kubernetes/EKS production issues and provide root cause analysis.
- Maintain comprehensive runbooks, operational documentation, and conduct knowledge transfer sessions.
Core Expertise Required:
Kubernetes / EKS:
- In-depth understanding of Kubernetes internals (controllers, API server, etcd, schedulers).
- Hands-on experience managing Amazon EKS clusters and associated best practices.
- Proficiency in authoring Kubernetes manifests (Deployments, StatefulSets, Services, Ingress).
- Familiarity with Helm, Helmfile, or Kustomize for Kubernetes deployment management.
Cloud & DevOps:
- Strong working knowledge of core AWS services (VPC, EC2, IAM, ALB/NLB, Route 53, CloudWatch, EFS).
- Experience with Terraform, CloudFormation, or AWS CDK for infrastructure automation.
- Proficiency with CI/CD tools (e.g., Jenkins, GitLab CI, ArgoCD, FluxCD).
- Understanding of GitOps workflows and DevOps principles.
Security & Observability:
- Expertise in container security: PodSecurityPolicies, SecurityContext, Seccomp, AppArmor.
- Hands-on with logging and monitoring tools such as Prometheus, Fluent Bit, Grafana, ELK, and AWS CloudWatch.
- Familiarity with service meshes (Istio, Linkerd, or AWS App Mesh) is a plus.
- Experience with container image scanning tools: Trivy, Clair, Prisma, or Aqua Security.
Soft Skills:
- Strong problem-solving and analytical skills.
- Excellent communication and collaboration capabilities.
- Ability to work cross-functionally with DevOps, Security, and Application teams.