☆185Jan 16, 2024Updated 2 years ago
Alternatives and similar repositories for gfn-lm-tuning
Users that are interested in gfn-lm-tuning are comparing it to the libraries listed below
Sorting:
- ☆43Jul 26, 2024Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆122Jan 31, 2026Updated last month
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆42Feb 9, 2024Updated 2 years ago
- Generative Flow Networks - GFlowNet☆317Updated this week
- Learning energy decompositions for partial inference in GFlowNets☆16Jun 4, 2024Updated last year
- ☆36Mar 29, 2025Updated 11 months ago
- ☆19May 11, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆87Feb 14, 2025Updated last year
- [NIPS 2023] AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation☆12May 19, 2023Updated 2 years ago
- Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)☆40Apr 21, 2024Updated last year
- ☆15Jul 9, 2025Updated 8 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆199Dec 17, 2024Updated last year
- ☆123Feb 21, 2025Updated last year
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆87Feb 22, 2023Updated 3 years ago
- PyTorch implementation for our ICLR 2024 paper "Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory…☆26Dec 21, 2023Updated 2 years ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆35Apr 8, 2023Updated 2 years ago
- Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]☆21Apr 15, 2024Updated last year
- ☆74Apr 27, 2024Updated last year
- ☆32Mar 27, 2025Updated 11 months ago
- ☆19Oct 2, 2023Updated 2 years ago
- ☆20Nov 3, 2024Updated last year
- Official PyTorch Implementation for Meaning Representations from Trajectories in Autoregressive Models (ICLR 2024)☆22May 14, 2024Updated last year
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆74Jul 14, 2025Updated 7 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Jan 14, 2025Updated last year
- ☆20Oct 25, 2022Updated 3 years ago
- Code for "Bayesian Structure Learning with Generative Flow Networks"☆96Mar 28, 2022Updated 3 years ago
- ☆342Jun 5, 2025Updated 9 months ago
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆24Nov 17, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- Implementations of the renormalization group-based diffusion model (RGDM).☆16Mar 10, 2025Updated 11 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆24Feb 27, 2025Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆26Jan 21, 2026Updated last month