Epicareer Might not Working Properly
Learn More

Site Realibility Engineer

Salary undisclosed

Apply on


Original
Simplified

Work you’ll do:

  • You will perform primary and secondary research, conduct analyses and appropriate modelling tasks that feeds directly into the development of technology-enabled solutions for tackling our clients’ complex business problems.
  • You will leverage on your training in technology, utilize analytical abilities and communication skills to support the project teams in delivery of our digital solution architectures and in development of work products that addresses our clients’ business needs and help achieve their strategic goals.
  • You will support the project teams in developing presentation materials and in coordination of communications with the client.
  • You will assist the project teams in delivery of business-driven, technology-enabled solutions to help our clients meet pressing challenges and seize opportunities in their respective markets.
  • You will work with diverse and talented project team members to solve problems, improve performance, and generate value for our clients across all industries.
  • You will uphold the firm’s standards and ethos in working with fellow team members and in your interactions with the clients.
  • You will support business development efforts by contributing directly to the preparation, development of proposals, presentations, and publications.


Requirements:

  • The right person will have at least 7 years of relevant experience in DevOps, SRE.
  • Should be well versed in the concepts of DevOps and have a full understanding of Site Reliability Engineering (SRE) principles.
  • Knowledge of the correlation between SLIs and SLOs when measuring service reliability
  • Must be familiar with well-known system monitoring and system configuration & management tools such as ElasticSearch, Grafana, Prometheus, Ansible and Saltstack
  • Must be familiar with Linux system, Administration, Linux Shell Programming (Bash)
  • Possesses Programming skills in one more of these languages: Java, Python
  • Experience addressing production issues with effective solutions, demonstrated strong ability in debugging/troubleshooting issues on application/infrastructure/operating system levels
  • Experience in administration, deployment, configuration, management and troubleshooting Kubernetes cluster and related application (e.g. Istio, Consul).
  • Experience in automating the deployment, configuration, management and troubleshooting containerized, cloud native applications running on Kubernetes.
  • Familiar with message queue systems (e.g. Kafka, RabbitMQ) and other distributed systems (e.g., Consul, Zookeeper, MongoDB, Redis etc.)
  • Experience in conducting system tests for security, performance, availability, and reliability.
  • Experience in coordinating with development teams to streamline code deployment with CICD and IAC pipelines, possesses the ability in building automated solutions through code.
  • Preferred Qualifications : CNCF Certification (CKA-Certified Kubernetes Administrator), AWS Certification (AWS Solution Architect Associate, AWS Solution Architect Professional)
  • Demonstrated portfolio of work showcasing technical competence
  • An appreciation of the consulting lifestyle and ability to travel (both locally and abroad) is a pre-requisite to fit to our short-term and long-term project assignment.