[ACL 2026 Oral] Official implementation of LaMI: Augmenting Large Language Models via Late Multi-Image Fusion
☆19May 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for LaMI
Users that are interested in LaMI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo of the paper "StressTest: Can YOUR Speech LM Handle the Stress?"☆20Jul 9, 2025Updated 11 months ago
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.☆34Jun 30, 2024Updated last year
- Official implementation of "Describing Sets of Images with Textual-PCA".☆16Feb 13, 2023Updated 3 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆50Aug 15, 2025Updated 9 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [InterSpeech 2023] The official PyTorch implementation of: "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Imag…☆89May 18, 2026Updated 3 weeks ago
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.☆21Jan 22, 2026Updated 4 months ago
- This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (…☆28Aug 8, 2022Updated 3 years ago
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆14Feb 26, 2025Updated last year
- ☆16Sep 6, 2024Updated last year
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆12Dec 4, 2023Updated 2 years ago
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Dec 20, 2021Updated 4 years ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).☆86Apr 15, 2025Updated last year
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆12Oct 25, 2022Updated 3 years ago
- Official implementation of "DGD: Dynamic 3D Gaussians Distillation".☆69Aug 16, 2024Updated last year
- ☆20Mar 5, 2026Updated 3 months ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- ☆16Jun 14, 2024Updated last year
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Feb 25, 2017Updated 9 years ago
- ☆28Nov 25, 2024Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [AAAI 2024] The official PyTorch implementation of "Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation"☆129May 18, 2026Updated 3 weeks ago
- An official PyTorch implementation for CLIPPR☆31Jul 22, 2023Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.☆24Aug 2, 2024Updated last year
- Code for our papers : "Generating images of rare concepts using pre-trained diffusion models" (AAAI 24) and "Norm-guided latent space exp…☆87Dec 27, 2023Updated 2 years ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆96Mar 12, 2025Updated last year
- ☆16Jul 23, 2024Updated last year
- ☆12Apr 25, 2026Updated last month
- rmp data ranking☆13Nov 4, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆106Nov 9, 2023Updated 2 years ago
- Implementation of the paper Silent Killer☆25Mar 18, 2024Updated 2 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Jun 10, 2021Updated 5 years ago
- ☆32Oct 18, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- TREE-G: Decision Trees Contesting Graph Neural Networks, specialized for graph data.☆13Feb 28, 2024Updated 2 years ago
- Pytorch implementation of of "Domain Intersection and Domain Difference" (ICCV 2019)☆39Mar 30, 2020Updated 6 years ago