Video to Text: Natural language description generator for some given video. [Video Captioning]
-
Updated
May 3, 2022 - Python
Video to Text: Natural language description generator for some given video. [Video Captioning]
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.
Generate captions for videos using the power of OpenAI's Whisper API
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
Generating video descriptions using deep learning in Keras
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
Convert images or videos to ASCII in the terminal
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
Chrome extension that helps students learn from YouTube videos
Python program able to transcribe a Youtube video to text with the help of AI.
Convert videos into colourful ASCII art for terminal display using Python and OpenCV.
A Python tool for transcribing videos using Whisper
This repository is an implementation of the Wav2Vec2 model for converting speech into text through a series of speech recognition, noise removal and STT to transcribe the text from a video file.
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?
Streamline your video/audio intake by transforming multimedia content into navigable collections of transcribed text and summaries!
Convert a video file or camera captured to display as text.
Add a description, image, and links to the video-to-text topic page so that developers can more easily learn about it.
To associate your repository with the video-to-text topic, visit your repo's landing page and select "manage topics."