Job Description
We are at the forefront of the AI revolution leading into 2026, building the next generation of autonomous agents and generative models. We are seeking a visionary Senior AI Engineer to help define the technical roadmap that will scale our infrastructure to meet the demands of a post-2024 era.
In this role, you will bridge the gap between cutting-edge research and production-grade systems. You will work on deploying Large Language Models (LLMs) with enhanced reasoning capabilities and optimizing inference pipelines for real-time applications.
Why join us?
- Future-First Culture: We are not just building for today; we are architecting the solutions for 2026 and beyond.
- Top-Tier Talent: Collaborate with PhDs and industry veterans in a state-of-the-art facility.
- Impactful Work: Your code will power the next generation of intelligent systems.
Responsibilities
- Architect and implement scalable machine learning pipelines for Generative AI and LLM applications.
- Optimize model inference performance using techniques like quantization, pruning, and distillation.
- Lead the technical strategy for our 2026 roadmap, evaluating emerging paradigms like Agentic AI.
- Collaborate with data scientists to fine-tune foundation models on proprietary datasets.
- Ensure production safety and ethical AI compliance across all deployed models.
- Conduct code reviews and mentor junior engineers on best practices for MLOps.
Qualifications
- Masterβs or PhD in Computer Science, Mathematics, or a related field.
- 5+ years of professional experience in machine learning engineering or applied research.
- Deep proficiency in Python, PyTorch, or TensorFlow.
- Extensive experience with vector databases (e.g., Pinecone, Milvus) and RAG architectures.
- Strong understanding of distributed systems and cloud infrastructure (AWS/GCP).
- Track record of deploying production-ready ML models.