Skip to content
@dvlab-research

DV Lab

Deep Vision Lab

Pinned

  1. LISA LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1.5k 103

  2. LongLoRA LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2.5k 255

  3. MGM MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3k 274

  4. LLaMA-VID LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 604 38

  5. Video-P2P Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 339 23

  6. LLMGA LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 261 17

Repositories

Showing 10 of 63 repositories
  • LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2,504 Apache-2.0 255 42 2 Updated Jun 2, 2024
  • MR-GSM8K Public

    Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

    Python 34 0 2 0 Updated Jun 1, 2024
  • MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3,036 Apache-2.0 274 49 2 Updated May 4, 2024
  • LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 1,513 Apache-2.0 103 54 1 Updated Apr 8, 2024
  • GroupContrast Public

    [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

    38 MIT 1 2 0 Updated Mar 15, 2024
  • Video-P2P Public

    Video-P2P: Video Editing with Cross-attention Control

    Python 339 23 5 0 Updated Mar 12, 2024
  • Parametric-Contrastive-Learning Public

    Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

    Python 231 MIT 29 5 0 Updated Feb 29, 2024
  • Prompt-Highlighter Public

    [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

    Python 104 MIT 2 2 0 Updated Jan 25, 2024
  • LLMGA Public

    This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

    Python 261 Apache-2.0 17 3 0 Updated Jan 23, 2024
  • LLaMA-VID Public

    Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

    Python 604 Apache-2.0 38 29 0 Updated Jan 10, 2024