mbzuai-oryx

Professional software vendor delivering innovative solutions on the Softono platform. Specialized in both open-source and proprietary software development.

Open Source

Video-ChatGPT

Video-ChatGPT is an advanced video conversation model designed to generate meaningful and detailed discussions about video content. Introduced by researchers at the Mohamed bin Zayed University of Artificial Intelligence and presented at ACL 2024, it bridges the gap between visual perception and language understanding. The architecture combines the generative capabilities of large language models with a pretrained visual encoder specifically adapted for spatiotemporal video representation. This integration enables the system to interpret complex visual dynamics and answer questions, summarize events, or engage in dialogue regarding video inputs. The project establishes a rigorous quantitative evaluation benchmark to assess performance across diverse video-based conversational tasks. It has demonstrated state-of-the-art results in zero-shot video question answering on datasets such as MSVD-QA, MSRVTT-QA, TGIF-QA, and ActivityNet, as well as in generative performance benchmarks. The software facilitates applica

LLM Tools & Chat UIs ML Frameworks

1.5K Github Stars

mbzuai-oryx

Software by mbzuai-oryx

Video-ChatGPT