An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
-
Updated
Jul 25, 2024 - Python
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
Video-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)
Capstone project for UPSchool AI First Developer Program
Add a description, image, and links to the video-text topic page so that developers can more easily learn about it.
To associate your repository with the video-text topic, visit your repo's landing page and select "manage topics."