DV Lab

LISA Public

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1.5k 103

LongLoRA Public

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2.5k 255

MGM Public

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3k 274

LLaMA-VID Public

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Python 604 38

Video-P2P Public

Video-P2P: Video Editing with Cross-attention Control

Python 339 23

LLMGA Public

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

Python 261 17

Provide feedback