We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.

(Senior) Infrastructure Manager (Cloud) - Annual Salary Package Up to 1M

Report
Print

(Senior) Infrastructure Manager (Cloud) - Annual Salary Package Up to 1M

PeopleLink Services Limited
Apply Now

Key Responsibilities:

AWS Infrastructure Management:

  • Design, deploy, and manage scalable, secure, and cost-effective AWS infrastructure 

Production Support & Incident Management:

  • Provide 24/7 production support for critical systems, ensuring high availability and minimal downtime.  
  • Respond to and resolve production incidents promptly, conducting root cause analysis (RCA) and implementing preventive measures.  
  • Participate in on-call rotations to address system outages and performance issues.  

Monitoring & Observability:

  • Implement and maintain monitoring and alerting systems using AWS CloudWatch, Prometheus, Grafana, or similar tools.  
  • Analyze system metrics and logs to identify trends, anomalies, and areas for improvement.  
  • Ensure comprehensive visibility into system health and performance.  

Automation & CI/CD:

  • Develop automation scripts and tools to streamline deployment, scaling, and operational tasks.  
  • Automate repetitive tasks to improve efficiency and reduce human error.  

Disaster Recovery & High Availability:

  • Design and implement disaster recovery plans to ensure business continuity in the event of system failures.  
  • Configure and maintain high-availability architectures, including load balancing, auto-scaling, and failover mechanisms.  

Collaboration & Best Practices:

  • Work closely with development teams to ensure applications are designed for reliability, scalability, and observability.  
  • Collaborate with security teams to ensure systems are secure and compliant with industry standards.  
  • Promote SRE best practices across the organization, including monitoring, logging, and incident management.  

 

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).   
  • Experience in production support and incident management, preferably in the fintech or financial industry.  

 

Technical Skills:

  • Knowledge of AWS services 
  • Experience with containerization and orchestration tools 
  • Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana).  

 

All applications applied through our system will be delivered directly to the advertiser and privacy of personal data of the applicant will be ensured with security.

More Information

SalaryN/A (Search your salary info in SalaryCheck)
Job Function
Location
  • Kowloon City
Work Model
  • On-site / At the workplace
Industry
Employment Term
  • Full-time
Experience
  • N/A
Education
  • Degree

Get lastest jobs, career news and
job invitations on-the-go.

Download the CTgoodjobs app

Download the CTgoodjobs app