Deskripsi Pekerjaan
Join the Agency for Science, Technology and Research (A*STAR) as a High Performance Computing (HPC) System Administrator and play a pivotal role in managing cutting-edge computational infrastructure that drives groundbreaking scientific research. In this position, you will be responsible for ensuring the stability, security, and optimal performance of our HPC systems, supporting researchers in their pursuit of innovation across various scientific disciplines.
As a key member of our IT infrastructure team, you will work with state-of-the-art technology and collaborate with leading scientists and researchers. This role offers an exceptional opportunity to contribute to Singapore's scientific advancement while developing expertise in high-performance computing systems, which are increasingly vital in today's research landscape.
If you are passionate about technology, enjoy solving complex system challenges, and want to make a meaningful impact in scientific research, we invite you to apply for this exciting position at A*STAR.
Tanggung Jawab
- Manage and maintain high-performance computing (HPC) systems, ensuring optimal performance and availability for researchers
- Implement and maintain security protocols to protect sensitive research data and computational resources
- Monitor system performance, troubleshoot issues, and implement solutions to ensure stability
- Install, configure, and update software and hardware components to meet research requirements
- Provide technical support and training to researchers on HPC systems and best practices
- Develop and document system procedures, policies, and disaster recovery plans
- Collaborate with research teams to understand their computational needs and provide tailored solutions
- Stay current with emerging HPC technologies and recommend improvements to existing infrastructure
Kualifikasi
- Bachelor's degree in Computer Science, Information Technology, or related field
- Minimum of 3 years of experience in system administration, preferably with HPC environments
- Strong knowledge of Linux/Unix operating systems and networking fundamentals
- Experience with cluster management systems and job schedulers (e.g., Slurm, PBS)
- Familiarity with storage systems and high-speed interconnects (InfiniBand, etc.)
- Excellent problem-solving skills and ability to work under pressure
- Strong communication skills and ability to work effectively with researchers from diverse backgrounds
- Certifications in relevant technologies (e.g., Linux, Cloud) are advantageous