In this video, I go through using the new Gemini 2.5 Pro for audio transcription and audio analysis tasks and show you how to get the best results out.
Colab: https://dripl.ink/mXQLh
Pricing: https://blog.google/products/gemini/g...
Gemini 2.5 Pro Capabilities: https://cloud.google.com/vertex-ai/ge...
For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: / samwitteveen
Twitter: https://x.com/Sam_Witteveen
🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes
👨💻Github:
https://github.com/samwit/llm-tutorials
⏱️Time Stamps:
00:00 Intro
00:19 Gemini 2.5 Pro Experimental Blog
01:03 Gemini 2.5 Pro Capabilities
01:27 Output Tokens
02:01 Pricing
02:30 Supported Audio Formats
02:43 Technical Details About Audio
05:25 Demo (Colab)
06:43 Audio Diarization Process