We use cookies to enhance your experience on our website. Please read and confirm your agreement to our Privacy Policy and Terms and Conditions before continue to browse our website.
Job Description: We are hiring multiple Site Reliability Engineers (SREs) to join our growing team. The SREs will work closely with the DevOps team to implement standardized tools and practices to ensure high reliability and scalability of our systems.
Responsibilities:
Maintain and enhance the reliability, availability, and performance of large-scale systems.
Follow established DevOps guidelines and standards for tool development and system management.
Develop automation scripts for monitoring, alerting, and incident response.
Collaborate with the DevOps team to improve infrastructure and platform tools (e.g. spug.cc)
Design and implement CI/CD pipelines using GitLab for application and infrastructure deployment.
Manage containerized environments using Kubernetes.
Monitor and analyze system metrics to optimize performance and efficiency.
Implement disaster recovery and high-availability strategies to ensure system resilience.
Requirements:
3-8 years of experience in SRE or DevOps roles.
Proficiency in Infrastructure as Code (IaC) using Terraform.
Strong expertise in Kubernetes for container orchestration.
Hands-on experience with CI/CD pipelines in GitLab.
Proficiency in scripting languages like Python and Bash.
Familiarity with cloud platforms such as AWS technology like EC2, KMS, VPC
Strong problem-solving and collaboration skills.
All applications applied through our system will be delivered directly to the advertiser and privacy of personal data of the applicant will be ensured with security.