RLAIF-V
RLAIF-V is an open-source framework for aligning Multimodal Large Language Models to achieve super GPT-4V trustworthiness. Presented as a highlight at CVPR 2025, the system utilizes a fully open-source paradigm that combines high-quality AI-generated feedback data with online feedback learning algorithms. The project provides access to the RLAIF-V-7B and RLAIF-V-12B model weights, along with the RLAIF-V-Dataset, which contains over 83,000 high-quality preference pairs generated across diverse tasks and models like LLaVA and MiniCPM-V. Key capabilities include significantly reducing hallucinations in generative and discriminative tasks, offering scalable inference rewards that improve performance with increased budget, and supporting LoRA training for efficient fine-tuning. The framework is designed to minimize reliance on closed-source models by leveraging robust open-source feedback to enhance model reliability and generalization.