Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI Architect: Generative Models

Nexus Horizon Labs
San Francisco
Estimated Salary
USD 180.000 – USD 260.000
New
Live Update
1 Juli 2026
Deadline
1 Jul 2027

Job Description

We are seeking a visionary Senior AI Architect to lead the next wave of Generative Intelligence at Nexus Horizon Labs. In this pivotal role, you will not just use AI; you will architect the systems that define its future. We are building a platform that scales to millions of users, requiring robust, efficient, and ethically sound machine learning infrastructure.

Join a team of world-class researchers and engineers dedicated to pushing the boundaries of Large Language Models (LLMs), diffusion models, and multimodal AI. If you are passionate about building scalable architectures and have a knack for solving complex data challenges, we want to hear from you.

Responsibilities

  • Design and implement scalable, high-performance inference pipelines for Large Language Models and diffusion architectures.
  • Lead research initiatives to improve model efficiency, reduce latency, and lower inference costs.
  • Architect robust Retrieval-Augmented Generation (RAG) systems to enhance model accuracy and context awareness.
  • Collaborate closely with product and engineering teams to translate AI capabilities into user-centric features.
  • Mentor junior engineers and data scientists, fostering a culture of technical excellence and innovation.
  • Ensure code quality, security, and compliance within all AI model deployments.

Qualifications

  • Master’s or PhD in Computer Science, Mathematics, or a related field with a focus on Machine Learning or Deep Learning.
  • 5+ years of professional experience in software engineering and machine learning, specifically with LLMs or Generative AI.
  • Strong proficiency in Python, PyTorch, TensorFlow, or JAX.
  • Deep understanding of Transformer architectures, Attention mechanisms, and fine-tuning strategies.
  • Experience deploying models to production environments using cloud infrastructure (AWS, GCP, or Azure).
  • Proven track record of optimizing model inference performance and managing large-scale datasets.

Required Skills

Python PyTorch TensorFlow Machine Learning Deep Learning Generative AI LLMs NLP Cloud Computing AWS GCP System Design

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All