Key Responsibilities:
Site Reliability Engineering: Ensure robust and stable IT systems to support business operations.
Automation and DevOps: Implement and manage automation tools and platforms such as Azure DevOps, Terraform, and Ansible Automation Platform.
Automated Cloud Provisioning (Azure & Alibaba Cloud with Terraform/SNOW)
E2E Automation for BAU Tasks & Incident Remediation (Ansible / Datadog)
Documentations: Create technical documentation, runbooks, architecture diagrams, and automation standards.
Technical Projects: Implement and deliver projects in hybrid environments (cloud and on-premises) with a focus on high availability, performance, and security.
Enterprise Platforms Management: Oversee cloud computing resources, servers, databases, SAN storage, data protection, and disaster recovery.
Operational Support: Manage and monitor housekeeping jobs, conduct regular system and hardware health checks, and perform disaster recovery drills.
After-Hours Support: Provide support outside of office hours when required.
Problem Diagnosis: Perform first-level diagnosis to identify and resolve issues, improving reliability.
Additional Duties: Perform other tasks as assigned by the team leader.
Requirements:
Education: Degree or diploma in Computer Science, Information Technology, or equivalent.
Experience: 5+ years of experience in cloud engineering, automation, or platform operations.
Certifications: IT professional certification (e.g., Azure Administrator Associate, Terraform Associate, or Alibaba Cloud ACA/ACP) is advantageous.
Technical Skills:
Proficiency in automation and DevOps tools such as Azure DevOps, Terraform, and Ansible Automation Platform.
Strong experience with scripting languages like PowerShell, Bash, and Python.
Familiarity with ServiceNow catalog and workflow is plus
Familiarity with monitoring tools such as Datadog and event-driven automation.
Experience with multi-cloud platforms like Azure and Alibaba Cloud.
Administration and troubleshooting of Microsoft Windows, Linux, MS SQL DB, Networking.
Knowledge of Active Directory, ADFS, MS cluster, SAN Storage, Commvault Backup, MS SCCM/SCOM, HP SIM/iLO, or System Management Homepage is a plus.
Language Proficiency: Proficiency in both spoken and written English.
Interested candidates please send your resume with expected salary and notice period to [via CTgoodjobs 立即申請]