Overview

Through the codelab, you will employ a step-by-step approach as follows: → → Utilize an existing Hugging Face Dataset of Yoga poses (JSON format). → Enhance the dataset with descriptions generated by the Gemini API. → Use Langchain and Firestore integration to create a collection with vector embeddings in Firestore. → Create a composite index in Firestore for efficient vector search. → Build an interactive Flask web application featuring: → Vector search for pose recommendations with metadata filtering (e.g., expertise level). → Real-time audio instruction generation using the Gemini Live API with voice selection. → Conversational follow-up capabilities for audio instructions. → Web search integration using the Gemini Live API and Google Search tool for broader yoga queries with audio responses. → Text-to-image generation related to web search conversations. → Deploy the application (optionally) to Google Cloud Run.

Google Vertex AI
Google Cloud
Firebase
Gemini
Beginner

Intro to Gemini Live API and Chirp Voices and Imagen3 API

Build out an enhanced version of the Yoga Poses recommender application that uses vector search incorporating multimodal features like audio instructions, web search integration, and image generation.

Published At: June 4, 2025

Last Updated At: June 7, 2025

4 Likes

Author

Deepansh Singh

@deepansharya1111

Get Started with Gradus

Join the Gradus and create codelabs to help developers grow, enhance their skills, and contribute to building a stronger developer ecosystem within your network.

Sign Up Now Sign In