Job Description
Be at the Forefront of the AI Revolution
We are seeking a visionary Senior Generative AI Engineer to join Nexus AI Labs in San Francisco. In this pivotal role, you will architect scalable Large Language Model (LLM) applications, implement sophisticated Retrieval-Augmented Generation (RAG) architectures, and fine-tune foundation models to solve complex, high-impact business problems. You will work closely with cross-functional teams of data scientists, researchers, and product managers to deploy ethical, reliable, and performant AI solutions.
Why Join Us?
- Work on cutting-edge AI research and production deployments.
- Competitive compensation package and equity options.
- Flexible remote and hybrid work culture.
Responsibilities
- Architecture & Development: Design and implement robust, scalable LLM pipelines and RAG systems using Python and modern cloud infrastructure (AWS/GCP).
- Model Fine-tuning: Fine-tune open-source LLMs (e.g., LLaMA, Mistral, Falcon) and commercial APIs (OpenAI, Anthropic) for domain-specific tasks.
- Performance Optimization: Optimize model inference latency and cost-efficiency using techniques like quantization and distillation.
- Data Engineering: Build and maintain high-quality datasets for training and evaluation, ensuring data privacy and security compliance.
- Collaboration: Partner with product and engineering teams to define AI product requirements and translate them into technical specifications.
Qualifications
- Education: Bachelor’s or Master’s degree in Computer Science, Machine Learning, or a related technical field.
- Experience: 5+ years of experience in software engineering or machine learning, with at least 2 years specifically focused on Generative AI or NLP.
- Technical Skills: Deep proficiency in Python, PyTorch, or TensorFlow. Experience with Hugging Face Transformers and LangChain.
- Frameworks: Familiarity with vector databases (Pinecone, Milvus, Weaviate) and vector embeddings.
- Communication: Excellent written and verbal communication skills with the ability to explain complex technical concepts to non-technical stakeholders.