jindongli-Ai / LLM-Discrete-Tokenization-SurveyView external linksLinks
The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is under review.
☆77Feb 3, 2026Updated last week
Alternatives and similar repositories for LLM-Discrete-Tokenization-Survey
Users that are interested in LLM-Discrete-Tokenization-Survey are comparing it to the libraries listed below
Sorting:
- ☆54Feb 3, 2026Updated last week
- The demo page for ALMTokenizer☆58Apr 14, 2025Updated 10 months ago
- Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs (ECCV 2024)☆19Jul 15, 2024Updated last year
- Explore how to get a VQ-VAE models efficiently!☆67Jul 24, 2025Updated 6 months ago
- Repository containing codebase for "FaceOff: A Video-to-Video Face Swapping Network" accepted at WACV 2023☆31Jan 22, 2023Updated 3 years ago
- ☆14Mar 12, 2023Updated 2 years ago
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆154May 30, 2025Updated 8 months ago
- Trainging, inference, and testing of the SAC speech codec model.☆96Nov 1, 2025Updated 3 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆105Apr 23, 2025Updated 9 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- Thesis Template☆10Jan 26, 2026Updated 3 weeks ago
- ☆13Aug 28, 2024Updated last year
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- ☆15Nov 27, 2025Updated 2 months ago
- ☆10Apr 13, 2022Updated 3 years ago
- Official Implementation of VarDrop(AAAI25)☆20Oct 23, 2025Updated 3 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆303May 29, 2025Updated 8 months ago
- [CVPR 2024] Dual Prototype Attention for Unsupervised Video Object Segmentation☆39Apr 21, 2024Updated last year
- ☆22Nov 18, 2025Updated 2 months ago
- ☆13Jan 22, 2025Updated last year
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated last month
- A small library which can parse TextGrid into json and json into TextGrid☆14Dec 14, 2021Updated 4 years ago
- Source codes for the paper "Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning" (PDMER) which p…☆14Mar 24, 2025Updated 10 months ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Aug 15, 2025Updated 6 months ago
- 南开大学网络空间安全学院计算机组成原理2023spring☆13Jan 22, 2024Updated 2 years ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- 国家税务总局全国增值税发票查验平台(https://inv-veri.chinatax.gov.cn/) 测试查询☆11Jan 3, 2023Updated 3 years ago
- [AAAI 2025] Official Implementation of "HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting"☆16Feb 17, 2025Updated 11 months ago
- A simple showstart script☆11May 6, 2024Updated last year
- Agentic Keyframe Search for Video Question Answering☆15Apr 7, 2025Updated 10 months ago
- ☆11Sep 27, 2023Updated 2 years ago
- 用于自动预约民政局婚姻登记处的号,限广东省民政局☆10Jun 25, 2023Updated 2 years ago
- 支持Linux DO的ChatGPT/Claude/Midjourney/API/Grok 共享平台-前端项目☆12Apr 30, 2025Updated 9 months ago
- Reversi AI based on Monte Carlo search algorithm☆10Apr 2, 2025Updated 10 months ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- CIKM 23 Oral - HoLe: Homophily-enhanced Structure Learning for Graph Clustering☆10Feb 29, 2024Updated last year
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- This project is a demonstration of a content-based recommendation system for Spotify that leverages user's preferences and audio features…☆16Apr 4, 2023Updated 2 years ago