GuidedGenerationGroup / crisp-dl-readLinks
DL reading group arrangements
☆21Updated 5 months ago
Alternatives and similar repositories for crisp-dl-read
Users that are interested in crisp-dl-read are comparing it to the libraries listed below
Sorting:
- The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".☆23Updated 4 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- Contrastive Reinforcement Learning☆59Updated last week
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆66Updated last year
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"☆31Updated 6 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Updated 2 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆124Updated 3 years ago
- Speech2Vec Reality Check☆88Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Updated 2 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- VQVAE for video prediction☆31Updated 3 years ago
- ☆21Updated 3 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 3 years ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆137Updated last month
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 3 years ago
- ☆55Updated 3 years ago
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆59Updated 2 years ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Updated 3 months ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆28Updated 2 years ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆111Updated 6 months ago
- Sentiment analysis of song lyrics compared to auditory track features and valence☆13Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Updated last year
- ☆48Updated last year
- Graph learning framework for long-term video understanding☆71Updated 6 months ago
- ☆144Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆66Updated 3 years ago
- SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)☆20Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆104Updated 2 years ago