Site Reliability Engineer
Salary undisclosed
Apply on
Original
Simplified
Job Description
- Design and implement the architecture of our next generation of automated infrastructure following Infrastructure as a Code model.
- Deliver high-quality code, request and conduct code reviews, and promote the company's development standards.
- Deploying, automating, maintaining and managing Cloud-based systems ensuring the availability, performance or scalability environment
- Be involved in change, release, and incident management, and resolve problems relating to critical service operations.
- Collaborate with engineering teams to improve reliability, stability, and solve scalability challenges.
- Write technical documentation relevant to the project.
- Optimize existing systems, build infrastructure, and reduce work through automation.
- Mentor other engineers, define our technical culture, and help build a fast-growing team.
Qualifications
- Preferably a degree in Computer Science, Software Engineering, Information Technology, or related fields, with a minimum of 3 years of experience with Linux environment.
- Preferable Malaysian Citizen / Resident
- Experience with containerization technologies (e.g Docker) and container orchestration platforms (e.g Kubernetes), GCP, GKE.
- Experience with implementing and improving CI/CD processes (build & deployment pipelines).
- Experience with infrastructure automation & provisioning tools (e.g Terraform & Ansible).
- Experience with monitoring tools usage: Dynatrace, Grafana, Prometheus, etc.
- Having exposure in supporting and administrating Cloud platforms such as AWS, Azure, Google Cloud and etc
- Preferable have experience in languages Go or Java
- Experience with scripting in languages like Python, Shell Scripting, and Bash.
- Have Mandarin speaking knowledge (Preferred)
Similar Jobs