Deskripsi Pekerjaan
We are seeking a highly skilled and motivated DevOps/Site Reliability Engineer (SRE) to join our engineering team at Aether Digital Indonesia. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our production systems. You will work closely with cross-functional teams to design and implement robust infrastructure solutions, automate operational tasks, and maintain high availability of services. This is a contract position based in West Jakarta.
At Aether Digital Indonesia, we believe in leveraging cutting-edge technology to deliver exceptional digital experiences. Our engineering team is dedicated to building resilient systems that can handle millions of users. We are looking for a passionate engineer who thrives in a fast-paced environment and is excited about solving complex infrastructure challenges.
In this role, you will be responsible for managing our cloud infrastructure on AWS, implementing CI/CD pipelines, monitoring system health, and ensuring disaster recovery plans are in place. You will also mentor junior engineers and contribute to the overall architecture of our platform.
If you are a proactive problem-solver with a strong background in DevOps and SRE practices, we would love to hear from you. Join us in shaping the future of digital technology in Indonesia.
Tanggung Jawab
- Design, implement, and maintain cloud infrastructure on AWS/GCP to support scalable web applications.
- Develop and maintain CI/CD pipelines to automate software deployments and infrastructure changes.
- Monitor system performance and reliability, ensuring SLAs are met through proactive incident response and root cause analysis.
- Manage container orchestration using Kubernetes and Docker for microservices deployment.
- Collaborate with software engineers to architect resilient systems and optimize resource utilization.
- Implement security best practices and ensure compliance with industry standards.
- Document system configurations, runbooks, and operational procedures.
- Provide on-call support for production incidents and drive continuous improvement.
Kualifikasi
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- 3+ years of experience in DevOps, SRE, or similar roles.
- Strong proficiency with cloud providers (AWS or GCP) and infrastructure-as-code tools like Terraform.
- Experience with containerization (Docker) and orchestration (Kubernetes).
- Solid understanding of CI/CD tools (Jenkins, GitLab CI, GitHub Actions).
- Proficiency in scripting languages (Python, Bash, or Go).
- Excellent problem-solving skills and ability to work in a team environment.
- Good communication skills in English.