allenai / tess-diffusionLinks
☆16Updated last year
Alternatives and similar repositories for tess-diffusion
Users that are interested in tess-diffusion are comparing it to the libraries listed below
Sorting:
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆54Updated last year
- ☆97Updated 2 years ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆24Updated 2 years ago
- Reparameterized Discrete Diffusion Models for Text Generation☆98Updated 2 years ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆73Updated 7 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆54Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 7 months ago
- A simple DIffusion LM approach.☆24Updated 2 years ago
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆97Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆70Updated 2 years ago
- ☆34Updated last year
- Few-shot Learning with Auxiliary Data☆28Updated last year
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆61Updated last month
- ☆14Updated last year
- Inference-Time Alignment in Protein Diffusion Models☆37Updated 4 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 9 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆55Updated 11 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated 2 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆28Updated 2 months ago
- Latent Diffusion Language Models☆68Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆22Updated 2 months ago
- Exploration of automated dataset selection approaches at large scales.☆42Updated 3 months ago
- ☆22Updated 7 months ago
- ☆32Updated 4 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 4 months ago