Job Description
About the Opportunity
We are pioneering the Technology Stack of 2026, focusing on autonomous agents, neural interfaces, and next-generation generative AI. As a Lead AI Architect, you will be at the forefront of building the infrastructure that powers the future. You won't just be maintaining systems; you will be defining the architectural standards for the next era of digital intelligence.
Why Join Us?
- Work on cutting-edge projects that define the roadmap for 2026 and beyond.
- Competitive compensation package with equity opportunities.
- Remote-first culture with hubs in San Francisco.
Responsibilities
- Architect and implement high-performance inference engines for Large Language Models (LLMs) and Agentic AI workflows.
- Design scalable distributed systems capable of processing petabytes of multimodal data in real-time.
- Lead the migration towards edge computing and decentralized AI networks to reduce latency.
- Optimize model latency, memory usage, and resource utilization across cloud and hybrid environments.
- Collaborate with product leaders to translate futuristic concepts into concrete, deployable technical solutions.
Qualifications
- 8+ years of experience in software engineering with a strong focus on AI/ML infrastructure and architecture.
- Deep expertise in Python, C++, and Rust for performance-critical applications.
- Strong proficiency in Kubernetes, Docker, and container orchestration.
- Experience deploying, fine-tuning, and serving open-source LLMs (Llama 3, Mistral, etc.).
- Excellent knowledge of cloud platforms (AWS, GCP, or Azure) and serverless architectures.