Videnza

Videnza

AI Application
2026

Turn any video into actionable intelligence. Paste a YouTube link or pick a local file — Videnza transcribes it, distills key insights, and lets you ask questions through a RAG-powered chatbot.

Videnza is a RAG-powered AI video assistant that transforms video content into structured intelligence. It supports YouTube URLs and local video/audio files, generating accurate transcriptions via Groq Whisper (English) and Sarvam AI (Hinglish). A map-reduce summarization pipeline powered by Mistral AI distills key insights, action items, and decisions, while a Pinecone vector store enables natural-language Q&A over the content. Results are surfaced through a clean dark-themed Streamlit UI with real-time pipeline progress tracking and PDF export for transcripts and insights.

Technologies Used

PythonStreamlitLangChainMistralAIPineconeFastAPI

Key Features

  • YouTube URL & local file upload support
  • English transcription via Groq Whisper
  • Hinglish transcription via Sarvam AI
  • Map-reduce summarization with Mistral AI
  • Automatic extraction of action items, key decisions, and open questions
  • RAG-powered chatbot for querying video content
  • Dark-themed Streamlit UI with pipeline progress tracking
  • PDF export for transcripts and insight reports
  • Audio processing pipeline with yt-dlp, FFmpeg, and pydub
  • Vector-based retrieval using Pinecone and Mistral embeddings