Deskripsi Pekerjaan
Are you ready to power the backbone of a global technology giant? ByteDance is seeking a skilled Datacenter Operations Engineer (DCO) to join our Infrastructure Engineering team in Singapore. As we continue to scale our hyperscale datacenter footprint to support millions of users worldwide, we need dedicated professionals to ensure our infrastructure remains performant, resilient, and cutting-edge.
In this role, you will be at the intersection of hardware deployment, network operations, and capacity management. You will work within high-pressure environments, collaborating with cross-functional global teams to maintain the heartbeat of our platforms. If you are passionate about large-scale server architecture, troubleshooting complex hardware issues, and optimizing datacenter efficiency, this is the perfect career move for you.
Tanggung Jawab
- Manage and maintain physical server infrastructure, including rack installation, cabling, and power distribution within hyperscale datacenter environments.
- Perform regular health checks and preventative maintenance on server clusters to ensure 99.99% availability.
- Troubleshoot and resolve complex hardware failures, working closely with vendor support and internal engineering teams to minimize downtime.
- Document technical procedures, incident reports, and infrastructure schematics to maintain operational standards.
- Coordinate with global teams for datacenter capacity planning, server refreshes, and site-wide upgrades.
- Monitor environmental and thermal conditions to ensure optimal performance of datacenter hardware.
- Participate in on-call rotations to manage urgent infrastructure incidents and escalations.
Kualifikasi
- Bachelor’s degree in Computer Science, Electrical Engineering, or a related technical discipline.
- 3+ years of professional experience in datacenter operations or critical infrastructure management.
- In-depth knowledge of server hardware architecture (Dell, HP, Supermicro) and rack-mount storage solutions.
- Strong understanding of structured cabling, power management (PDU/UPS), and cooling technologies.
- Experience with Linux OS administration and basic scripting (Python or Bash) for automation tasks.
- Proven ability to thrive in a fast-paced, high-growth environment while adhering to strict safety and security protocols.
- Excellent communication skills with the ability to collaborate effectively across distributed global teams.