Apply on
Original
Simplified
Job Summary:
We are seeking a highly skilled and experienced Cloud Engineer with expertise in AWS / Azure and Kubernetes. The ideal candidate will be responsible for designing, implementing, and maintaining a highly available and resilient cloud infrastructure for microservice applications. This role requires a deep understanding of AWS services and container orchestration with Kubernetes. The successful candidate will collaborate with cross-functional teams to ensure the reliability, scalability, and performance of the cloud-based applications.
Responsibilities:
- Design, deploy, and manage highly available, scalable, and fault-tolerant cloud infrastructure using AWS services.
- Develop and implement automation and orchestration tools for deployment, monitoring, and management of cloud resources.
- Collaborate with development teams to design and implement CI/CD pipelines for microservice applications running on Kubernetes.
- Ensure the security and compliance of cloud infrastructure and microservices by implementing best practices and industry standards.
- Troubleshoot and resolve issues related to cloud infrastructure, application deployment, and performance bottlenecks.
- Perform capacity planning, monitoring, and optimization of the cloud-based applications to ensure high availability and cost-effectiveness.
- Collaborate with cross-functional teams to define and implement disaster recovery and business continuity strategies.
- Stay up-to-date with the latest trends and advancements in cloud technologies, DevOps practices, and microservice architectures.
Required Skills and Qualifications:
- Bachelor’s degree in Computer Science, Engineering, or a related field
- Aws EKS or Azure AKS, experience in managing large k8/ cluster, hands-on yaml manifest
- Observability - grafana, Prometheus, ELK, cloud watch
- Hands-on experience in Terraform or Pulumi coding to set up and tear down cloud infrastructure
- Strong knowledge in EC2, VPC networking (security group, IG, NAT, VPC peering)
- Hands-on experience in set up and operate Confluent Kafka , Elastic stake and AWS RDS/postgres/redis cluster
- Strong devOps pipeline especially GitHub Actions
- Site Resiliency Engineering experience, performance tuning, chaos test and security penetration test experience
Similar Jobs