LCM-Lab / Bridge_Gap_DiffusionLinks
☆34Updated 2 years ago
Alternatives and similar repositories for Bridge_Gap_Diffusion
Users that are interested in Bridge_Gap_Diffusion are comparing it to the libraries listed below
Sorting:
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆83Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆75Updated 2 years ago
- ☆25Updated 2 months ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆98Updated 2 years ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆55Updated last year
- ☆141Updated last year
- [IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".☆31Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- ☆107Updated 2 years ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆32Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 months ago
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆66Updated 3 years ago
- Language modeling via stochastic processes. Oral @ ICLR 2022.☆138Updated 2 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- Reparameterized Discrete Diffusion Models for Text Generation☆101Updated 2 years ago
- ☆130Updated 3 years ago
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆59Updated 2 years ago
- Mixture of Attention Heads☆49Updated 2 years ago
- ☆18Updated 2 years ago
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆43Updated last year
- ☆51Updated 2 years ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆39Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 5 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆55Updated 2 years ago
- ☆27Updated 2 years ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆69Updated last year
- Directional Preference Alignment☆57Updated last year
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆27Updated 2 months ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆24Updated 2 years ago