Job Description
We are seeking a visionary Senior AI Engineer to join our elite technical team in San Francisco. At Nexus Future Systems, we are building the next generation of autonomous intelligent agents and large-scale language models for 2026 and beyond. If you are passionate about pushing the boundaries of what is possible with Generative AI, PyTorch, and Deep Learning, we want to hear from you.
In this role, you will bridge the gap between theoretical research and production-grade software, ensuring our AI systems are scalable, efficient, and ethically sound.
Responsibilities
- Model Development: Design, train, and fine-tune state-of-the-art Large Language Models (LLMs) and generative AI agents.
- Infrastructure Optimization: Deploy models on scalable cloud infrastructure (AWS/GCP) using Docker and Kubernetes to ensure high availability.
- RAG Pipelines: Architect and maintain Retrieval-Augmented Generation systems to enhance model accuracy and reduce hallucinations.
- Code Review & Mentorship: Lead code reviews for junior engineers and provide technical mentorship to foster a culture of excellence.
- Performance Tuning: Continuously monitor model inference latency and optimize computational efficiency.
- Research Integration: Stay ahead of the curve by integrating cutting-edge research papers and open-source innovations into our production stack.
Qualifications
- Education: Masterβs or PhD in Computer Science, Mathematics, or a related field, or equivalent practical experience.
- Programming: Proficiency in Python with deep understanding of PyTorch or TensorFlow.
- Experience: 5+ years of experience in machine learning engineering, NLP, or deep learning.
- Frameworks: Extensive experience with Hugging Face Transformers, LangChain, or similar agent frameworks.
- Cloud: Strong experience deploying models on AWS or GCP.
- Tools: Familiarity with MLOps tools (MLflow, Weights & Biases) and CI/CD pipelines.