LaVi-Lab/Visual-Table

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LaVi-Lab/Visual-Table)

LaVi-Lab / Visual-Table

[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"

☆20

Alternatives and similar repositories for Visual-Table

Users that are interested in Visual-Table are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
Seeing-Fast-and-Slow / Seeing-Fast-and-Slow
View on GitHub
☆16May 28, 2026Updated last month
Han-Zongbo / Skip-n
View on GitHub
This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.
☆15Feb 12, 2024Updated 2 years ago
Gary-code / Awesome-LVLM-paper
View on GitHub
List of papers about Large Multimodal model
☆30May 31, 2025Updated last year
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alibaba / alimama-video-narrator
View on GitHub
Research code for ACL2024 paper: "Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline"
☆42Dec 27, 2024Updated last year
boyazeng / understand_bias
View on GitHub
Code release for "Understanding Bias in Large-Scale Visual Datasets"
☆25Dec 4, 2024Updated last year
pkunlp-icler / MIC
View on GitHub
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
☆49Jul 13, 2025Updated last year
yfzhang114 / SliME
View on GitHub
✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
☆163Dec 26, 2024Updated last year
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
Ken-Chy129 / student-course-choosing
View on GitHub
基于 Spring Boot + Redis + RabbitMQ 的高并发学生选课系统，支持选退课、课程管理、实时消息通知
☆11Mar 31, 2026Updated 3 months ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
Yuqifan1117 / HalluciDoctor
View on GitHub
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)
☆52Jul 16, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
whwu95 / FreeVA
View on GitHub
FreeVA: Offline MLLM as Training-Free Video Assistant
☆69Jun 9, 2024Updated 2 years ago
takomc / amp
View on GitHub
【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"
☆22Sep 26, 2024Updated last year
BeSpontaneous / Proteus-pytorch
View on GitHub
Proteus (ICLR2025)
☆61Mar 26, 2025Updated last year
Luodian / nano-hevc
View on GitHub
A minimal, educational HEVC (H.265) encoder written in Python.
☆53Feb 23, 2026Updated 4 months ago
umd-huang-lab / perceptionCLIP
View on GitHub
Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"
☆80May 5, 2024Updated 2 years ago
apple2373 / figure-separator
View on GitHub
compound figure separation tool
☆22Jun 13, 2024Updated 2 years ago
zhangjiewu / awesome-t2i-eval
View on GitHub
A curated list of papers and resources for text-to-image evaluation.
☆30Sep 6, 2023Updated 2 years ago
AlenUbuntu / Awesome-Vision-and-Language-PreTrain-Papers
View on GitHub
☆14Dec 25, 2020Updated 5 years ago
findalexli / mllm-dpo
View on GitHub
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆48Nov 10, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ChenDelong1999 / instruct-flamingo
View on GitHub
🚀 Codebase and Fondation Models for Visual Instruction Tuning
☆14Aug 19, 2023Updated 2 years ago
ImperialNLP / BertGen
View on GitHub
Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)
☆11Sep 17, 2023Updated 2 years ago
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 4 months ago
penghao-wu / visual_jigsaw
View on GitHub
☆78Apr 9, 2026Updated 3 months ago
nytopop / csm
View on GitHub
A Conversational Speech Generation Model
☆14Mar 16, 2025Updated last year
shubhamprshr27 / NeglectedTailsVLM
View on GitHub
This repository houses the code for the paper - "The Neglected of VLMs"
☆30Dec 31, 2025Updated 6 months ago
tsb0601 / MultiMon
View on GitHub
☆25Jun 22, 2023Updated 3 years ago
MonolithFoundation / Bumblebee
View on GitHub
A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.
☆38Sep 9, 2024Updated last year
JinchaoLove / CUHK-PhD-Thesis-Template
View on GitHub
Latex template for CUHK PhD Thesis
☆14Jun 29, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
bronyayang / HallE_Control
View on GitHub
HallE-Control: Controlling Object Hallucination in LMMs
☆32Apr 10, 2024Updated 2 years ago
utter-project / fairseq
View on GitHub
This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.
☆21Nov 19, 2024Updated last year
wllmzhu / G-VUE
View on GitHub
General-purpose Visual Understanding Evaluation
☆20Dec 21, 2023Updated 2 years ago
ys-zong / FoolyourVLLMs
View on GitHub
[ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations
☆15Oct 28, 2023Updated 2 years ago
cutopia-labs / CUtopia
View on GitHub
Course review and timetable planning platform used by thousands of CUHK students
☆13Aug 19, 2024Updated last year
IntMeGroup / MINT-IQA
View on GitHub
[TMM] MINT-IQA: Quality Assessment for AI Generated Images with Instruction Tuning
☆21Nov 21, 2025Updated 8 months ago
liang2kl / simpledb
View on GitHub
清华大学计算机系《数据库系统概论》2022 年大作业项目 DBMS，支持基础 SQL 的解析和执行。
☆12Jan 12, 2023Updated 3 years ago