System Engineer (GPU infrastructure)
RM 8,000 - RM 8,000 / month
Checking job availability...
Original
Simplified
Key Responsibilities:
- Design and deploy scalable, high-performance systems that leverage GPU resources for AI applications. Ensure optimal configuration for various workloads.
- Manage and maintain the underlying infrastructure, including servers, networking, and storage solutions. Monitor system performance and implement necessary optimizations.
- Work closely with software engineers and customers to understand their requirements and provide the necessary infrastructure support for AI model training and deployment.
- Implement, manage, monitor, and maintain the platform to ensure optimal performance and high reliability.
- Provide technical guidance across complex infrastructure projects.
- Diagnose and resolve system issues related to hardware, software, and network performance. Provide technical support for internal teams and customers as needed.
- Develop automation scripts to streamline system deployment, monitoring, and maintenance tasks.
QUALIFICATIONS
- Bachelor’s degree in Computer Science or a related technical field
- Proven experience (3+ years) as a System Engineer or in a similar role within IT infrastructure or cloud services.
- Introduce technology and software to improve the performance, resiliency, and quality of service in IT infrastructure.
- Strong experience in managing bare metal servers, GPU infrastructure, or high-performance computing systems.
- Familiarity with monitoring tools (e.g., Prometheus, Grafana) and logging frameworks.
- Possess a deep understanding of Linux fundamentals.
- Understand the Kubernetes environments and be able to run the debugging.
Job Types: Full-time, Permanent
Pay: Up to RM8,000.00 per month
Benefits:
- Dental insurance
- Health insurance
- Maternity leave
- Opportunities for promotion
- Parental leave
- Professional development
- Vision insurance
Schedule:
- Monday to Friday
Application Question(s):
- How much is your expected salary?
- Do you have experience in managing bare metal servers, GPU infrastructure, or high-performance computing systems?
- How long is your notice period?
Work Location: In person