Job Description
Shape the Future of Synthetic Intelligence.
Nexus AI Labs is pioneering the next evolution in generative AI. We are seeking a visionary Senior AI Engineer to lead the architecture, training, and deployment of the 2026.5 Large Language Model. This is not just a job; it is an opportunity to define how machines understand and generate human-like reasoning.
Our Mission: We aim to bridge the gap between narrow AI and Artificial General Intelligence (AGI) by creating models that are not only capable but safe, efficient, and deeply integrated into enterprise workflows.
Why Nexus?
- Work with cutting-edge infrastructure (NVIDIA H100 clusters).
- Competitive equity and comprehensive benefits package.
- Unlimited PTO and a culture of radical transparency.
Responsibilities
- Model Engineering: Design and implement fine-tuning strategies for the 2026.5 architecture, focusing on hallucination reduction and factual accuracy.
- System Optimization: Reduce inference latency and memory footprint through quantization techniques and model distillation.
- MLOps Pipeline: Build and maintain robust CI/CD pipelines for model training, evaluation, and deployment on Kubernetes.
- Research Integration: Collaborate with our research team to integrate novel attention mechanisms and transformer variants.
- Performance Tuning: Conduct rigorous benchmarking to ensure the model outperforms competitors in logic, coding, and creative writing tasks.
- Technical Leadership: Mentor junior engineers and conduct code reviews to maintain high engineering standards.
Qualifications
- Education: MS or PhD in Computer Science, Mathematics, Statistics, or a related field.
- Experience: 5+ years of professional experience in Deep Learning, specifically with LLMs or Transformers.
- Tools: Proficiency in PyTorch, TensorFlow, or JAX; strong experience with Hugging Face ecosystem.
- Infrastructure: Deep understanding of distributed training, GPU optimization, and cloud platforms (AWS/GCP/Azure).
- Programming: Expert-level Python skills, with experience in C++ for performance-critical components.
- Soft Skills: Excellent communication skills and a passion for solving complex problems.