Job Description
About Us
We are Horizon Systems, a leading innovator in artificial intelligence and machine learning infrastructure. We are building the next generation of autonomous agents and large-scale generative models designed to transform industries. We are seeking a visionary Senior Generative AI Engineer to join our elite engineering team in Seattle.
The Role
In this pivotal position, you will be responsible for architecting, training, and deploying cutting-edge Large Language Models (LLMs) and multimodal systems. You will work at the intersection of research and production engineering, optimizing model performance and scalability for real-world applications.
Why Join Us?
• Work on projects that define the future of AI technology.
• Competitive compensation and equity packages.
• Flexible remote-first culture with a dynamic Seattle hub.
• Access to top-tier computing resources and the latest research.
Responsibilities
- Model Development: Design and implement state-of-the-art generative models, including LLMs and diffusion models, using PyTorch and TensorFlow.
- Optimization: Fine-tune pre-trained models for specific domains, focusing on efficiency, latency, and memory consumption.
- Deployment: Lead the deployment of models into production environments using Kubernetes and cloud infrastructure (AWS/GCP).
- R&D: Stay ahead of the curve by researching emerging trends in AI safety, alignment, and novel architectures.
- Collaboration: Partner with product managers and data scientists to translate complex research into scalable software solutions.
Qualifications
- Experience: 5+ years of experience in software engineering, with at least 3 years specifically in Machine Learning or AI.
- Technical Skills: Proficiency in Python, C++, and deep learning frameworks (PyTorch or TensorFlow).
- Education: MS or PhD in Computer Science, Artificial Intelligence, or a related quantitative field.
- Knowledge: Strong understanding of NLP, transformer architectures, and prompt engineering best practices.
- Problem Solving: Ability to debug complex distributed systems and optimize training pipelines.