This job is in your area. Enjoy a short commute and work close to home.
Job Description
What You Will Be Doing
- Design and implement production-ready generative AI applications that serve millions of users, from initial architecture through deployment and monitoring
- Build advanced RAG (Retrieval-Augmented Generation) pipelines that combine vector databases, hybrid search, and intelligent caching to deliver sub-second response times
- Develop multimodal AI systems that seamlessly integrate text, vision, and audio capabilities using state-of-the-art models
- Architect scalable microservices that handle thousands of concurrent AI requests while optimizing for cost, latency, and reliability
- Lead code reviews and technical design sessions, establishing best practices and architectural patterns that elevate the entire team's capabilities
- Optimize large language models through fine-tuning techniques to achieve domain-specific performance improvements
- Implement comprehensive MLOps practices including automated testing, ...