
Overview
Through the codelab, you will employ a step-by-step approach as follows: → → Utilize an existing Hugging Face Dataset of Yoga poses (JSON format). → Enhance the dataset with descriptions generated by the Gemini API. → Use Langchain and Firestore integration to create a collection with vector embeddings in Firestore. → Create a composite index in Firestore for efficient vector search. → Build an interactive Flask web application featuring: → Vector search for pose recommendations with metadata filtering (e.g., expertise level). → Real-time audio instruction generation using the Gemini Live API with voice selection. → Conversational follow-up capabilities for audio instructions. → Web search integration using the Gemini Live API and Google Search tool for broader yoga queries with audio responses. → Text-to-image generation related to web search conversations. → Deploy the application (optionally) to Google Cloud Run.
Intro to Gemini Live API and Chirp Voices and Imagen3 API
Build out an enhanced version of the Yoga Poses recommender application that uses vector search incorporating multimodal features like audio instructions, web search integration, and image generation.
Published At: June 4, 2025
Last Updated At: June 7, 2025
4 Likes
Get Started with Gradus
Join the Gradus and create codelabs to help developers grow, enhance their skills, and contribute to building a stronger developer ecosystem within your network.
Sign Up Now Sign In