Deskripsi Pekerjaan
Capgemini is looking for a visionary GenAI Architect to join our dynamic team in Kuala Lumpur. As a global leader in consulting and technology services, we are at the forefront of the artificial intelligence revolution. In this strategic role, you will bridge the gap between complex business challenges and cutting-edge generative AI solutions.
You will work closely with C-suite stakeholders and portfolio leaders to identify, architect, and deploy high-ROI GenAI and agentic use cases. Whether it is transforming knowledge work automation or enabling data-driven decision support, you will be the technical lead driving innovation. If you are passionate about Large Language Models (LLMs), prompt engineering, and scalable AI infrastructure, this is your opportunity to shape the future of enterprise intelligence.
Tanggung Jawab
- Partner with business and product leaders to identify and prioritize high-value GenAI use cases.
- Design scalable architectures for LLM integration, RAG (Retrieval-Augmented Generation) pipelines, and agentic workflows.
- Lead the end-to-end development lifecycle of AI solutions, from conceptualization to enterprise-grade production.
- Evaluate and select appropriate AI frameworks, vector databases, and cloud infrastructure (Azure, AWS, or GCP).
- Collaborate with cross-functional data science and engineering teams to ensure ethical AI deployment and robust data governance.
- Define and implement prompt engineering strategies and fine-tuning methodologies to optimize model performance.
- Drive continuous improvement by monitoring model efficacy, latency, and cost-efficiency of deployed AI agents.
Kualifikasi
- Bachelor’s or Master’s degree in Computer Science, AI, Data Science, or a related technical field.
- 5+ years of experience in software architecture, with a minimum of 2 years focused specifically on Generative AI and LLMs.
- Deep understanding of LLM architectures (GPT-4, Claude, Llama 3) and framework expertise (LangChain, LlamaIndex).
- Hands-on experience with vector databases (e.g., Pinecone, Milvus, Weaviate) and MLOps practices.
- Proficiency in Python and familiarity with modern cloud AI services (Azure OpenAI, AWS Bedrock).
- Proven track record of stakeholder management and communicating technical AI concepts to non-technical business leaders.
- Ability to thrive in a fast-paced, collaborative, and global consultancy environment.