Job Description
We are seeking a visionary Senior AI Research Engineer to join our elite team at FutureScale Labs. As we architect the intelligent systems of tomorrow, we are looking for a technical leader who is passionate about pushing the boundaries of Generative AI. In this role, you will be instrumental in developing the foundational models and algorithms that will define the AI landscape of 2026 and beyond.
You will work in a high-performance environment focused on cutting-edge research, safety, and scalability. If you are obsessed with Large Language Models (LLMs), multimodal architectures, and ethical AI deployment, this is the opportunity to build the future.
Responsibilities
- Design, train, and fine-tune proprietary Generative AI models, including LLMs and diffusion models, tailored for enterprise-grade applications.
- Lead research initiatives focused on improving model reasoning, context retention, and hallucination reduction for 2026-era capabilities.
- Implement and optimize inference pipelines using GPU clusters to ensure low-latency, high-throughput deployment.
- Collaborate with product teams to translate complex AI research into robust, user-facing features.
- Establish best practices for model evaluation, safety alignment, and continuous learning systems.
- Mentor junior engineers and researchers, fostering a culture of technical excellence and innovation.
Qualifications
- Masterβs or PhD degree in Computer Science, Mathematics, or a related field, with a focus on Machine Learning or Artificial Intelligence.
- 5+ years of professional experience in building, deploying, or researching deep learning models (PyTorch or TensorFlow).
- Deep understanding of Transformer architectures, attention mechanisms, and natural language processing.
- Proven track record of publishing research in top-tier conferences (NeurIPS, ICML, ACL) or contributing to open-source LLM communities.
- Strong programming skills in Python, with experience in distributed computing frameworks (e.g., Ray, Spark) and cloud infrastructure (AWS/GCP/Azure).
- Experience with prompt engineering, RAG (Retrieval-Augmented Generation), and fine-tuning techniques.