Epicareer Might not Working Properly
Learn More

云平台运维工程师 (Cloud Platform Operations Engineer)

  • Full Time, onsite
  • NEXALINK MARKETING SDN. BHD.
  • Kuala Lumpur Networks & Systems Administration (Information & Communication Technology) Full time RM 13, Malaysia
Salary undisclosed

Apply on


Original
Simplified

Job Responsibilities:

  • Responsible for the daily maintenance of production servers, cloud platforms, Windows and Linux operating systems, and middleware.
  • Perform routine inspections, troubleshooting, monitoring, and upgrades of business systems.
  • Manage and maintain the company's information security, system data backup, and daily monitoring.
  • Ensure the reliability and stability of underlying support systems, including monitoring, alerting, and operations management, to guarantee 24/7 system high availability.
  • Job Qualifications:

  • Bachelor’s degree or above, with proficiency in operating and managing cloud platforms like GCP and AWS, and experience in cloud service planning, delivery, and operations.
  • In-depth knowledge of container technologies and Kubernetes, with hands-on experience in implementing projects, particularly in areas such as observability, multi-cluster management, elastic scheduling, and service meshes.
  • Ability to adapt to and excel in the SRE (Site Reliability Engineering) work model, using automation tools to handle routine tasks and improve efficiency.
  • Strong architectural thinking, able to efficiently contribute to company projects by providing comprehensive solutions in networking, security, and deployment.
  • Proficient in Linux operating systems, with skills in diagnosing, analyzing, and resolving common issues, and experience in writing and using Shell or Python scripts to solve business problems.
  • Familiar with TCP/IP networking principles and routing/switching knowledge, capable of basic LAN planning and troubleshooting.
  • Excellent communication skills and a strong sense of teamwork, able to strictly follow business management and process protocols. Proactive in thinking, summarizing, and improving, with a strong sense of responsibility.
  • Proficient in both English and Chinese communication; candidates with a K8S certification will be given preference.
  • 岗位职责:
    1、负责生产服务器、云平台、windows linux操作系统、中间件的日常维护。
    2、负责业务系统的日常巡检、故障排查、日常监控和升级维护。
    3、负责公司信息安全,系统数据备份,公司监控的日常管理和维护
    4、技术保障各底层支撑系统的可靠性与稳定性,监控、报警、运维等管理工作,保证系统7x24高可用
    任职资格:
    1、本科以上学历,熟悉GCP、AWS等云平台的操作和管理,具备云服务规划和交付运维经验。
    2、对容器技术和Kubernetes有深入研究,具备实际项目落地经验,特别是在可观测性、多集群管理、弹性调度、服务网格等方面。
    3、适应并擅长SRE工作模式,能够通过自动化工具解决日常重复性工作,提高工作效率。
    4、具备架构思维,能够高效参与公司项目,提供包括网络、安全、部署在内的全面解决方案。
    5、熟悉Linux操作系统,具备常见故障的诊断、分析和处理能力,能够编写并应用Shell或Python脚本解决业务问题。
    6、熟悉TCP/IP网络原理和路由交换知识,能够进行基本的局域网络规划和问题处理。
    7、具有良好的沟通能力和团队合作意识,能够严格遵从业务管理和流程规范,善于思考、总结和改进,具备强烈的责任心。
    8、能熟练使用中英文进行沟通,有K8S证书优先考虑。

    Benefits

    • Career Growth & Development Opportunities
    • Work-life balance
    • Friendly & Supportive Working Environment
    • 5-days Work Week
    • Amazing work locations
    • Overseas trips
    Similar Jobs