Epicareer Might not Working Properly
Learn More

Site Reliability Engineer

Salary undisclosed

Apply on


Original
Simplified

Job Description

  • Design and implement the architecture of our next generation of automated infrastructure following Infrastructure as a Code model.
  • Deliver high-quality code, request and conduct code reviews, and promote the company's development standards.
  • Deploying, automating, maintaining and managing Cloud-based systems ensuring the availability, performance or scalability environment
  • Be involved in change, release, and incident management, and resolve problems relating to critical service operations.
  • Collaborate with engineering teams to improve reliability, stability, and solve scalability challenges.
  • Write technical documentation relevant to the project.
  • Optimize existing systems, build infrastructure, and reduce work through automation.
  • Mentor other engineers, define our technical culture, and help build a fast-growing team.

Qualifications

  • Preferably a degree in Computer Science, Software Engineering, Information Technology, or related fields, with a minimum of 3 years of experience with Linux environment.
  • Preferable Malaysian Citizen / Resident
  • Experience with containerization technologies (e.g Docker) and container orchestration platforms (e.g Kubernetes), GCP, GKE.
  • Experience with implementing and improving CI/CD processes (build & deployment pipelines).
  • Experience with infrastructure automation & provisioning tools (e.g Terraform & Ansible).
  • Experience with monitoring tools usage: Dynatrace, Grafana, Prometheus, etc.
  • Having exposure in supporting and administrating Cloud platforms such as AWS, Azure, Google Cloud and etc
  • Preferable have experience in languages Go or Java
  • Experience with scripting in languages like Python, Shell Scripting, and Bash.
  • Have Mandarin speaking knowledge (Preferred)