Beranda Loker Detail
G
Information & Communication Technology 🏢 Full Time ⭐️ Terverifikasi

Program Manager II, Data Center Incidents and Availability

Google
Singapore
Estimasi Gaji
SGD 140.000 – SGD 200.000
Terbaru
Live Update
27 Juni 2026
Batas Akhir
27 Jun 2027

Deskripsi Pekerjaan

Are you ready to make a significant impact on the reliability and availability of Google’s global infrastructure? We are seeking a Program Manager II to lead our data center incident management and prevention efforts in Singapore. In this pivotal role, you will be responsible for overseeing the entire incident lifecycle, from detection and response to resolution and post-mortem analysis. You will work closely with engineering teams to drive continuous improvement, ensuring that our data centers operate with peak efficiency and resilience. If you thrive in a fast-paced environment and have a passion for solving complex technical challenges, we want to hear from you.

As a Program Manager in our Technical Infrastructure team, you will bridge the gap between technical execution and business strategy. You will manage cross-functional projects, coordinate with vendors, and ensure that availability targets are met while maintaining the highest standards of safety and security.

Tanggung Jawab

  • Lead the incident management lifecycle for critical data center outages, ensuring rapid response and resolution.
  • Implement and maintain high availability standards and reliability engineering processes across infrastructure.
  • Collaborate with engineering, operations, and support teams to resolve complex technical issues and prevent recurrence.
  • Drive root cause analysis (RCA) and develop preventative measures to minimize future incidents.
  • Manage cross-functional project plans, timelines, and resource allocation for infrastructure initiatives.
  • Monitor system performance and key metrics to proactively identify potential risks before they impact availability.
  • Communicate status updates to stakeholders and executive leadership effectively during incidents and routine operations.

Kualifikasi

  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (Master’s degree preferred).
  • 5+ years of experience in program management, incident management, or systems engineering.
  • Proven track record of managing high-availability systems, disaster recovery, and crisis management.
  • Strong understanding of data center infrastructure, networking, and hardware components.
  • Excellent analytical and problem-solving skills with the ability to work under pressure.
  • Experience with incident response frameworks and crisis management tools.
  • Fluency in English with strong written and verbal communication skills.

Keahlian yang Dibutuhkan

Program Management Incident Management Data Center Operations Availability Management Cross-functional Collaboration Root Cause Analysis Reliability Engineering Crisis Management

Siap Mengambil Tantangan Ini?

Pastikan resume Anda sudah siap. Kirimkan lamaran Anda sekarang sebelum tanggal deadline.

Lamar Sekarang

Lowongan Terkait

Rekomendasi pekerjaan serupa untuk Anda

Lihat Semua