CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".
☆322Apr 13, 2024Updated last year
Alternatives and similar repositories for CLoT
Users that are interested in CLoT are comparing it to the libraries listed below
Sorting:
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆14Dec 3, 2024Updated last year
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- The official implementation of two AI-enhanced numerical solvers: NeurVec (Sci. Rep.) and AttNS (ICML'24)☆27May 21, 2024Updated last year
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆86Jul 13, 2024Updated last year
- ☆37Jan 25, 2024Updated 2 years ago
- Just for debug☆56Feb 15, 2024Updated 2 years ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Nov 24, 2025Updated 3 months ago
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆17Jul 1, 2024Updated last year
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life☆367Dec 2, 2024Updated last year
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Jun 16, 2023Updated 2 years ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆948Nov 13, 2024Updated last year
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Apr 9, 2024Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- This is the official repository for M2UGen☆514Jan 2, 2025Updated last year
- Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS☆381Nov 26, 2023Updated 2 years ago
- Code and data of "Controllable Unsupervised Event-based Video Generation" (accepted as ICIP oral and invited by WACV workshop)☆19Nov 5, 2024Updated last year
- [ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset☆25May 29, 2025Updated 9 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆654Aug 17, 2024Updated last year
- Emu Series: Generative Multimodal Models from BAAI☆1,768Jan 12, 2026Updated last month
- [CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation☆784May 24, 2024Updated last year
- ☆210Apr 15, 2024Updated last year
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- ☆19Dec 20, 2024Updated last year
- Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)☆504Apr 24, 2024Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Feb 29, 2024Updated 2 years ago
- Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)☆22Nov 28, 2022Updated 3 years ago
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated 2 months ago
- Official implementation of DreaMoving☆1,802Jan 9, 2024Updated 2 years ago
- ☆20Feb 16, 2026Updated 2 weeks ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation☆71Oct 17, 2025Updated 4 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer☆249Apr 3, 2024Updated last year
- ☆46Oct 28, 2025Updated 4 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,844Feb 1, 2025Updated last year
- ☆123Jun 6, 2024Updated last year
- Official implementation of FouriScale (ECCV2024)☆159Jul 27, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- ☆24Jul 31, 2024Updated last year