Deskripsi Pekerjaan
Are you a seasoned Infrastructure Engineer with a passion for mission-critical stability? TG Group Pte Ltd is seeking a highly skilled Infrastructure Engineer (Day 2) to join our dynamic technical team in the North-East Region. In this role, you will be the backbone of our operational excellence, ensuring that our infrastructure remains resilient, high-performing, and secure in a fast-paced 24/7 environment.
As an Infrastructure Engineer, you will focus on the 'Day 2' lifecycle—optimizing, scaling, and maintaining production systems. You will collaborate with cross-functional teams to resolve complex technical challenges, enhance system reliability, and implement automation strategies that drive efficiency. If you possess deep troubleshooting expertise and thrive under pressure, we invite you to take the next step in your career with us.
Tanggung Jawab
- Manage, monitor, and maintain mission-critical infrastructure to ensure 99.9% uptime.
- Perform deep-dive troubleshooting and root cause analysis (RCA) for complex system incidents.
- Execute Day 2 operational tasks including performance tuning, capacity planning, and resource optimization.
- Collaborate with engineering teams to deploy patches, updates, and configuration changes in a 24/7 production environment.
- Develop and maintain automated scripts to streamline repetitive operational workflows.
- Document system architectures, operational procedures, and incident recovery playbooks.
- Participate in on-call rotations to provide timely technical support for urgent system issues.
Kualifikasi
- Singaporean citizens only.
- Minimum of 3-5 years of experience in Infrastructure Engineering or Site Reliability Engineering (SRE).
- Strong hands-on expertise in Linux/Windows server administration and network troubleshooting.
- Proven ability to operate effectively within high-stakes, 24/7 production environments.
- Solid understanding of cloud platforms (AWS/Azure/GCP) and virtualization technologies.
- Experience with monitoring tools and observability stacks (e.g., Prometheus, Grafana, ELK).
- Strong analytical mindset with excellent problem-solving and communication skills.
- Certification in relevant infrastructure technologies is highly regarded.