Site Reliability Engineer

Salary undisclosed

Apply on

Original

Simplified

Job Description

Design and implement the architecture of our next generation of automated infrastructure following Infrastructure as a Code model.
Deliver high-quality code, request and conduct code reviews, and promote the company's development standards.
Deploying, automating, maintaining and managing Cloud-based systems ensuring the availability, performance or scalability environment
Be involved in change, release, and incident management, and resolve problems relating to critical service operations.
Collaborate with engineering teams to improve reliability, stability, and solve scalability challenges.
Write technical documentation relevant to the project.
Optimize existing systems, build infrastructure, and reduce work through automation.
Mentor other engineers, define our technical culture, and help build a fast-growing team.

Qualifications

Preferably a degree in Computer Science, Software Engineering, Information Technology, or related fields, with a minimum of 3 years of experience with Linux environment.
Preferable Malaysian Citizen / Resident
Experience with containerization technologies (e.g Docker) and container orchestration platforms (e.g Kubernetes), GCP, GKE.
Experience with implementing and improving CI/CD processes (build & deployment pipelines).
Experience with infrastructure automation & provisioning tools (e.g Terraform & Ansible).
Experience with monitoring tools usage: Dynatrace, Grafana, Prometheus, etc.
Having exposure in supporting and administrating Cloud platforms such as AWS, Azure, Google Cloud and etc
Preferable have experience in languages Go or Java
Experience with scripting in languages like Python, Shell Scripting, and Bash.
Have Mandarin speaking knowledge (Preferred)