GuidedGenerationGroup / crisp-dl-readLinks
DL reading group for people on Threads
☆17Updated last month
Alternatives and similar repositories for crisp-dl-read
Users that are interested in crisp-dl-read are comparing it to the libraries listed below
Sorting:
- The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".☆24Updated 3 years ago
- Speech2Vec Reality Check☆84Updated 2 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆33Updated last year
- ☆17Updated last year
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆14Updated 4 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆58Updated 11 months ago
- Sentiment analysis of song lyrics compared to auditory track features and valence☆13Updated 2 years ago
- ☆13Updated 3 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆12Updated 2 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆15Updated 4 months ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆23Updated 2 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆121Updated 3 years ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆106Updated 2 months ago
- ☆44Updated last year
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Updated last month
- VQVAE for video prediction☆28Updated 3 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆40Updated last year
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆44Updated last month
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆54Updated 10 months ago
- ☆55Updated 2 years ago
- An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"☆25Updated 2 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated last year
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆17Updated last year
- [NeurIPS 2022] FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation☆24Updated 2 years ago
- The official repo of continuous speculative decoding☆30Updated 6 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆131Updated 2 weeks ago