Home Job Details
N
Information Technology 🏢 Full Time ⭐️ Verified

Senior Generative AI Engineer (LLM Specialist)

Nexus AI Labs
San Francisco
Estimated Salary
USD 180.000 – USD 250.000
Live Update
14 Mei 2026
Deadline
14 Mei 2027

Job Description

Be at the Forefront of the AI Revolution

We are seeking a visionary Senior Generative AI Engineer to join Nexus AI Labs in San Francisco. In this pivotal role, you will architect scalable Large Language Model (LLM) applications, implement sophisticated Retrieval-Augmented Generation (RAG) architectures, and fine-tune foundation models to solve complex, high-impact business problems. You will work closely with cross-functional teams of data scientists, researchers, and product managers to deploy ethical, reliable, and performant AI solutions.

Why Join Us?

  • Work on cutting-edge AI research and production deployments.
  • Competitive compensation package and equity options.
  • Flexible remote and hybrid work culture.

Responsibilities

  • Architecture & Development: Design and implement robust, scalable LLM pipelines and RAG systems using Python and modern cloud infrastructure (AWS/GCP).
  • Model Fine-tuning: Fine-tune open-source LLMs (e.g., LLaMA, Mistral, Falcon) and commercial APIs (OpenAI, Anthropic) for domain-specific tasks.
  • Performance Optimization: Optimize model inference latency and cost-efficiency using techniques like quantization and distillation.
  • Data Engineering: Build and maintain high-quality datasets for training and evaluation, ensuring data privacy and security compliance.
  • Collaboration: Partner with product and engineering teams to define AI product requirements and translate them into technical specifications.

Qualifications

  • Education: Bachelor’s or Master’s degree in Computer Science, Machine Learning, or a related technical field.
  • Experience: 5+ years of experience in software engineering or machine learning, with at least 2 years specifically focused on Generative AI or NLP.
  • Technical Skills: Deep proficiency in Python, PyTorch, or TensorFlow. Experience with Hugging Face Transformers and LangChain.
  • Frameworks: Familiarity with vector databases (Pinecone, Milvus, Weaviate) and vector embeddings.
  • Communication: Excellent written and verbal communication skills with the ability to explain complex technical concepts to non-technical stakeholders.

Required Skills

Python Machine Learning NLP Large Language Models LLMs Deep Learning PyTorch TensorFlow RAG Prompt Engineering Generative AI OpenAI API Vector Databases

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All