Deskripsi Pekerjaan
Are you a seasoned Infrastructure Specialist who thrives in the high-stakes, fast-paced environment of a scaling startup? Our team is looking for a Senior Platform Infrastructure Engineer to help architect and maintain the backbone of our globally distributed operations. As a startup, we operate differently—prioritizing speed, ownership, and technical excellence over bureaucracy. We are a remote-first organization, meaning you will have the autonomy to solve complex problems while collaborating with elite talent from around the world.
In this role, you won't just be 'managing servers'; you will be building a resilient, self-healing platform that empowers our developers to deploy with confidence. You will bridge the gap between software engineering and systems operations, ensuring our infrastructure is as code-driven and automated as possible. If you are passionate about Kubernetes, Infrastructure as Code (IaC), and building developer-centric platforms, this is the perfect opportunity for you.
We value engineers who are proactive and can navigate the ambiguity of a startup environment. You will have a direct impact on our technical roadmap, security posture, and overall system reliability. Join us to build the future of our platform and scale your career alongside a company that is redefining its industry.
Tanggung Jawab
- Design, implement, and maintain scalable cloud infrastructure using AWS, GCP, or Azure.
- Architect and manage production-grade Kubernetes clusters, ensuring high availability and performance.
- Develop and maintain Infrastructure as Code (IaC) using Terraform, Pulumi, or CloudFormation.
- Build and optimize CI/CD pipelines to streamline the developer experience and reduce time-to-market.
- Implement comprehensive monitoring, logging, and observability stacks (Prometheus, Grafana, ELK) to ensure 24/7 system health.
- Lead security hardening initiatives across the platform, including IAM, networking, and secret management.
- Collaborate with cross-functional product teams to provide architectural guidance and troubleshooting support.
- Participate in a blameless on-call rotation and lead post-mortem analyses to prevent recurring issues.
Kualifikasi
- Minimum of 5 years of experience in Platform Engineering, SRE, or Infrastructure roles.
- Expert-level proficiency with Kubernetes (K8s) and container orchestration.
- Strong background in Linux systems administration and networking fundamentals.
- Proven experience with Infrastructure as Code (Terraform preferred).
- Advanced scripting or programming skills in Go, Python, or Ruby.
- Experience working in a globally distributed, remote-first startup environment.
- Deep understanding of cloud security best practices and compliance frameworks.
- Excellent communication skills and the ability to articulate complex technical concepts to non-technical stakeholders.