Site Reliability Engineer, SRE, Monitoring, DevOps
Your new company
Our client is a leading global retailer known for its commitment to innovation and excellence in technology solutions. They are currently seeking a Site Reliability Engineer (SRE) to join their dynamic team. In this role, you will play a key part in enhancing system reliability and performance.
Your new role
- Design and maintain observability tools to enhance system monitoring.
- Collaborate with application owners and infrastructure teams to identify potential issues.
- Build dashboards and alerts tailored to application requirements.
- Review Service Level Objectives (SLOs) for critical applications.
- Mentor users on observability best practices.
What you'll need to succeed
- Strong problem-solving skills with a focus on identifying performance issues.
- Detail-oriented with the ability to break down complex systems and communication skills to translate user experiences into measurable indicators.
- Experience with APM, and observability tools is highly preferred (e.g. OpenTelemetry)
- Experience in automation tools (e.g. Ansible)
- Background in application development (e.g., Python, Java, JavaScript) is a plus.
What you need to do now
If you're interested in this role, click '
Apply Now' to forward your updated CV to [via CTgoodjobs
Apply Now], or call Cherry Ho (+852 2230 7493) for a confidential discussion now.
All applications applied through our system will be delivered directly to the advertiser and privacy of personal data of the applicant will be ensured with security.