Job Description
We are seeking a visionary Senior AI Architect to lead the next wave of Generative Intelligence at Nexus Horizon Labs. In this pivotal role, you will not just use AI; you will architect the systems that define its future. We are building a platform that scales to millions of users, requiring robust, efficient, and ethically sound machine learning infrastructure.
Join a team of world-class researchers and engineers dedicated to pushing the boundaries of Large Language Models (LLMs), diffusion models, and multimodal AI. If you are passionate about building scalable architectures and have a knack for solving complex data challenges, we want to hear from you.
Responsibilities
- Design and implement scalable, high-performance inference pipelines for Large Language Models and diffusion architectures.
- Lead research initiatives to improve model efficiency, reduce latency, and lower inference costs.
- Architect robust Retrieval-Augmented Generation (RAG) systems to enhance model accuracy and context awareness.
- Collaborate closely with product and engineering teams to translate AI capabilities into user-centric features.
- Mentor junior engineers and data scientists, fostering a culture of technical excellence and innovation.
- Ensure code quality, security, and compliance within all AI model deployments.
Qualifications
- Masterβs or PhD in Computer Science, Mathematics, or a related field with a focus on Machine Learning or Deep Learning.
- 5+ years of professional experience in software engineering and machine learning, specifically with LLMs or Generative AI.
- Strong proficiency in Python, PyTorch, TensorFlow, or JAX.
- Deep understanding of Transformer architectures, Attention mechanisms, and fine-tuning strategies.
- Experience deploying models to production environments using cloud infrastructure (AWS, GCP, or Azure).
- Proven track record of optimizing model inference performance and managing large-scale datasets.