GuidedGenerationGroup / crisp-dl-readLinks
DL reading group arrangements
☆21Updated 5 months ago
Alternatives and similar repositories for crisp-dl-read
Users that are interested in crisp-dl-read are comparing it to the libraries listed below
Sorting:
- The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".☆23Updated 4 years ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆15Updated 8 months ago
- ☆19Updated last year
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 8 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆66Updated last year
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆34Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Updated 5 months ago
- The official repo of continuous speculative decoding☆31Updated 10 months ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆124Updated 3 years ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆46Updated 5 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- Speech2Vec Reality Check☆88Updated 2 years ago
- ☆14Updated 10 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆14Updated 8 months ago
- ☆14Updated 8 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Updated 10 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆137Updated last month
- Contrastive Reinforcement Learning☆59Updated last week
- ☆13Updated 4 years ago
- Official code for the paper "Attention as a Hypernetwork"☆47Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Updated 2 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Updated 2 years ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆61Updated last year
- ☆55Updated 3 years ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Updated 11 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Updated 2 years ago
- Data-Efficient Multimodal Fusion on a Single GPU☆68Updated last year
- ☆16Updated 9 months ago