Cloud System Reliability Engineer (SRE) (Mandarin speaker)
Our client is a global leader in the next generation of digital services and consulting. This role is to support Tencent project. This role is required to work on-call/ standby.
- Face our customers and services support teams, troubleshoot and analyze production problems, provide effective preventive solutions.
- Inspect and maintain our monitoring metrics, identify hidden problems and work with backend engineers to solve it.
- Complete daily operation tasks, e.g. service release, setting of monitoring metrics, etc.
- Analyze production problems and optimize the service to improve its availability.
- Owning and improving the scalability and reliability of our services products.
- Working directly with product engineering teams and infrastructure teams.
- Hands on designing, coding, configuring, debugging, and monitoring.
- Advocating for DevOps and SRE culture and best practices in cross-functional teams.
- Maintaining a positive and supportive team work culture.
- Good Mandarin proficiency.
- This role is required to work on call/ standby.