DaShenZi721 / HRALinks
☆28Updated last week
Alternatives and similar repositories for HRA
Users that are interested in HRA are comparing it to the libraries listed below
Sorting:
- Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".☆63Updated 2 weeks ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆31Updated 7 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated 11 months ago
- A list of papers for group meeting☆16Updated 3 weeks ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆44Updated 2 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 7 months ago
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆31Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆40Updated last year
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling".☆29Updated 2 months ago
- ☆19Updated 2 months ago
- PyTorch implementation for our ICLR 2024 paper "Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory…☆24Updated last year
- Official Jax Implementation of MD4 Masked Diffusion Models☆100Updated 3 months ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆26Updated 11 months ago
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆54Updated last year
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆46Updated last week
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆30Updated last year
- Interpretable Diffusion Via Information Decomposition☆28Updated 10 months ago
- ☆146Updated 8 months ago
- Deep Learning & Information Bottleneck☆60Updated last year
- Reparameterized Discrete Diffusion Models for Text Generation☆98Updated 2 years ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆61Updated last month
- A Collection of Papers on Diffusion Language Models☆60Updated this week
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion-LM-Papers".☆132Updated this week
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆18Updated 2 years ago
- Pytorch implementation of ICML-2024 "Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching"☆24Updated 11 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆152Updated 3 months ago
- Official PyTorch implementation for "Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations"☆39Updated last year
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆20Updated 7 months ago
- Generative Equilibrium Transformer☆18Updated last year
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆16Updated 2 months ago