Job Description
We are seeking a visionary Senior Generative AI Engineer to join our elite engineering team in San Francisco. As we look toward the 2026 technological horizon, we need a builder who understands the intricacies of Large Language Models (LLMs) and wants to shape the future of human-AI interaction.
In this role, you will be at the forefront of innovation, deploying state-of-the-art AI solutions that drive our product roadmap. You will work in a fast-paced environment where your code will impact millions of users globally.
Why Join Us?
- Competitive compensation and equity packages.
- Access to cutting-edge GPU clusters and cloud infrastructure.
- Collaborative culture with industry leaders in AI and machine learning.
Responsibilities
- Model Development: Design, train, and fine-tune large-scale generative models (LLMs) using PyTorch and TensorFlow.
- Production Deployment: Deploy AI models to scalable cloud environments (AWS/Azure) ensuring high availability and low latency.
- Optimization: Implement techniques such as quantization, pruning, and distillation to optimize model inference performance.
- Research: Stay abreast of the latest academic papers and industry trends in NLP and Generative AI.
- Integration: Integrate AI capabilities into existing software ecosystems using LangChain and API development.
- Mentorship: Guide junior engineers and data scientists, fostering a culture of technical excellence.
Qualifications
- Education: Masterβs degree or PhD in Computer Science, Mathematics, or a related field.
- Experience: 5+ years of professional experience in software engineering or machine learning.
- Technical Skills: Deep expertise in Python, PyTorch, or JAX.
- Frameworks: Proven experience with Hugging Face Transformers, LangChain, or similar NLP frameworks.
- Production Maturity: Demonstrable history of deploying ML models to production environments.
- Problem Solving: Strong analytical skills with a focus on solving complex, ambiguous problems.