Server Engineer
RM 4,000 - RM 4,999 / Per Mon
Apply on
Availability Status
This job is expected to be in high demand and may close soon. We’ll remove this job ad once it's closed.
Original
Simplified
Server Engineer JD and KRA Key Responsibilities: 1. Server Infrastructure Management: o Design, configure, and manage Windows Server, Linux (RHEL, CentOS), and virtualized environments (VMware, Hyper-V). o Implement and maintain server clusters for high availability (HA) and disaster recovery (DR) using technologies like Microsoft Failover Cluster, vSphere HA, and DR configurations. o Optimize server performance by performing regular resource monitoring and capacity planning (CPU, RAM, disk I/O, network throughput). o Patch management: Regularly update and patch OS and server applications, ensuring compliance with security policies. o Troubleshooting of complex server hardware, OS-level issues, and application failures. 2. Storage Infrastructure (SAN, NAS, DAS): o Administer enterprise storage solutions such as SAN (EMC, NetApp, HPE 3PAR), NAS (NetApp, Isilon), and DAS. o Storage provisioning and management: Create and assign LUNs, volumes, and file systems, ensuring proper tiering, load balancing, and optimization. o Capacity management: Monitor and predict storage capacity utilization trends, implement proactive measures to avoid outages or resource shortages. o Storage replication and high availability: Implement technologies like SnapMirror, VVols, MetroCluster, and Replication Manager for data protection and disaster recovery. o Storage performance tuning: Perform disk I/O optimizations and address performance bottlenecks related to storage subsystems. o Data lifecycle management (ILM): Design and implement strategies for archiving, migration, and backup of data across on-premises and cloud-based storage. 3. Backup Solutions and Disaster Recovery: o Backup software administration: Implement and manage enterprise backup solutions (e.g., Veeam, NetBackup, Commvault, Veritas, DPM). o Backup infrastructure design: Design, configure, and maintain backup servers, storage devices (tape libraries, disk arrays), and backup scheduling. o Data recovery and restoration: Perform full, incremental, and differential backups, and ensure rapid recovery of mission-critical data with minimal RTO and RPO. o Cloud-based backups: Configure and maintain cloud backup solutions such as Azure Backup, AWS S3, or Google Cloud Storage for offsite data protection. o Backup monitoring: Set up alerting and monitoring tools (e.g., SolarWinds, Nagios, Zabbix) to ensure the integrity of backups and the timely completion of scheduled jobs. o Disaster Recovery Testing: Conduct regular testing of DR processes, ensuring systems are restored according to defined RTO/RPO objectives. This includes both virtual and physical disaster recovery plans. 4. Advanced Troubleshooting & Issue Resolution: o Root-cause analysis (RCA) of complex server, storage, and backup failures, providing resolutions that adhere to SLAs. o Advanced diagnostic tools: Utilize tools such as Wireshark, iLO, IMM, ESXi diagnostic utilities, and RAID management interfaces for in-depth issue analysis. o Log analysis: Review system logs (Syslog, event logs, VMware logs, etc.) to identify anomalies and prevent potential failures. Server Engineer JD and KRA 5. Monitoring & Reporting: o Utilize monitoring tools like Prometheus, Zabbix, Nagios, PRTG, and SolarWinds to continuously monitor server, storage, and backup health, ensuring system uptime. o Create custom dashboards and reports on storage usage, backup status, and server performance. o Produce regular backup and storage health reports to leadership, providing insights into system stability and trends. 6. Security & Compliance: o Implement security measures such as encryption (at-rest, in-transit), access control, and rolebased security for storage and backup systems. o Ensure that server, storage, and backup configurations comply with regulatory standards (GDPR, HIPAA, PCI-DSS) and internal security policies. o Perform security audits and vulnerability assessments on backup storage and server environments, addressing identified issues. 7. Documentation & Knowledge Sharing: o Maintain detailed documentation for server, storage, and backup environments, including configurations, troubleshooting steps, and procedures. o Provide knowledge transfer to lower-level engineers and create runbooks for recurring tasks (e.g., server provisioning, backup failure recovery). 8. Collaboration and Coordination: o Collaborate with cross-functional teams (network engineers, cloud teams, database administrators) to ensure storage and backup requirements are met. o Coordinate with external vendors for hardware replacement, warranty support, and software updates for storage/backup systems
Similar Jobs