Epicareer Might not Working Properly
Learn More

Manager, Site Reliability Engineering

Salary undisclosed

Checking job availability...

Original
Simplified
In This Role You Will: -Lead by example for site reliability engineering execution and manage stages from ideation and development to launch and ongoing maintenance, ensuring timely delivery and high-quality technology infrastructure that meet customer needs, and incorporating feedback loops for iterative improvements. -Identify and assist on the stability, scalability, cost optimisation, and security of the technology infrastructure by continuously assessing, upgrading, and implementing best practices, using specific frameworks or standards to support business growth and protect sensitive data. -Manage high-performing site reliability engineering teams by recruiting top talent, providing mentorship, and fostering professional growth, with a focus on long-term team development to create a collaborative and innovative work culture. -Organise collaboration within technology teams by creating efficient communication to enhance collaboration and achieve shared goals. -Assist on identifying risk in technology infrastructure by conducting proactive risk assessments and execute contigency plan to ensure regulatory and security compliance for infrastructure and database deliverables -Execute strategic vision for technology infrastructure improvement by executing the roadmap of initiatives which align with company goals to ensure continuous improvement. -Ensure adherence to the compliance of company policies, industry regulations and legal requirements. You're A Great Fit If You Have: -8+ years of experience in site reliability engineering. -Working experience in managing large distributed teams and driving strategic initiatives. -Strong bias for action with proven ability to prioritise and manage multiple tasks effectively in a fast-paced environment. -Proficiency in version control systems (VCS) like Git. -Advance proficiency in cloud technology especially AWS services. -Advance proficiency in containerisation technology such as Docker and Kubernetes. -Advance proficiency in NoSQL, SQL, event, queue and cache databases such PostgreSQL, MongoDB, Kafka, RabbitMQ and Redis. -Advance proficiency in monitoring technology such as Vector, Loki, Tempo, OpenTelemetry, VictoriaMetrics (Prometheus) and Grafana. -Advance proficiency in networking and security especially around load balancing, firewall, encryptions. -Extensive experience in infrastructure as code (IaC) and CICD technology such as Terraform, ArgoCD and Argo Workflows. -Working experience in cloud architecture or solution architecture. -Ability to handle sensitive information with confidentiality. -Excellent communication and interpersonal skills.