GFNOrg/gfn-lm-tuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GFNOrg/gfn-lm-tuning)

GFNOrg / gfn-lm-tuning

☆191

Alternatives and similar repositories for gfn-lm-tuning

Users that are interested in gfn-lm-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zdhNarsil / Awesome-GFlowNets
View on GitHub
A curated list of resources about generative flow networks (GFlowNets).
☆503Oct 1, 2024Updated last year
GFNOrg / torchgfn
View on GitHub
A modular, easy to extend GFlowNet library
☆312Jul 11, 2026Updated 2 weeks ago
GFNOrg / diffusion-finetuning
View on GitHub
☆43Jul 26, 2024Updated 2 years ago
Yu-Fangxu / FoR
View on GitHub
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆126Jan 31, 2026Updated 5 months ago
GFNOrg / gflownet
View on GitHub
Generative Flow Networks
☆684Feb 28, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alexhernandezgarcia / gflownet
View on GitHub
Generative Flow Networks - GFlowNet
☆339Updated this week
GFNOrg / GFlowNet-EM
View on GitHub
Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.
☆42Feb 9, 2024Updated 2 years ago
GFNOrg / gfn-diffusion
View on GitHub
☆38Mar 29, 2025Updated last year
hsjang0 / LED-GFN
View on GitHub
Learning energy decompositions for partial inference in GFlowNets
☆16Jun 4, 2024Updated 2 years ago
d-tiapkin / gflownet-rl
View on GitHub
Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)
☆41Apr 21, 2024Updated 2 years ago
zdhNarsil / Diffusion-Generative-Flow-Samplers
View on GitHub
PyTorch implementation for our ICLR 2024 paper "Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory…
☆26Dec 21, 2023Updated 2 years ago
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
XiaojuanTang / Mars
View on GitHub
a benchmark to evaluate the situated inductive reasoning
☆16Jan 7, 2025Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bpwu1 / confidence-regulation-neurons
View on GitHub
Confidence Regulation Neurons in Language Models (NeurIPS 2024)
☆15Feb 1, 2025Updated last year
tristandeleu / jax-dag-gflownet
View on GitHub
Code for "Bayesian Structure Learning with Generative Flow Networks"
☆95Mar 28, 2022Updated 4 years ago
GFNOrg / EB_GFN
View on GitHub
Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"
☆85Feb 22, 2023Updated 3 years ago
llm4html / llm4html
View on GitHub
☆18Sep 29, 2022Updated 3 years ago
portal-cornell / muCode
View on GitHub
☆33Oct 2, 2025Updated 9 months ago
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
wutong4012 / AR-Diffusion
View on GitHub
[NIPS 2023] AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
☆12May 19, 2023Updated 3 years ago
dbsxodud-11 / ls_gfn
View on GitHub
Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)
☆25Feb 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
probcomp / LLaMPPL
View on GitHub
A domain-specific probabilistic programming language for modeling and inference with language models
☆143Apr 29, 2025Updated last year
zdhNarsil / GFlowNet-CombOpt
View on GitHub
PyTorch implementation for our NeurIPS 2023 spotlight paper "Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with G…
☆68May 30, 2023Updated 3 years ago
xlang-ai / text2reward
View on GitHub
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
☆210Dec 17, 2024Updated last year
rmshin / llm-mcts
View on GitHub
☆40Jun 19, 2024Updated 2 years ago
ARM-gradient / ARSM
View on GitHub
Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…
☆17Jul 1, 2020Updated 6 years ago
SalesforceAIResearch / LaTRO
View on GitHub
☆127Jun 2, 2026Updated last month
wiio12 / POETRY
View on GitHub
Code for the paper: Proving Theorems Recursively
☆12May 23, 2024Updated 2 years ago
alexrame / rewardedsoups
View on GitHub
Rewarded soups official implementation
☆64Sep 27, 2023Updated 2 years ago
VITA-Group / Data-Efficient-Scaling
View on GitHub
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang
☆14Jan 4, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆32Nov 5, 2025Updated 8 months ago
HKUNLP / diffusion-vs-ar
View on GitHub
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
☆94Feb 14, 2025Updated last year
SparkJiao / dpo-trajectory-reasoning
View on GitHub
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆84Jan 14, 2025Updated last year
Dahoas / QDSyntheticData
View on GitHub
☆14Aug 15, 2024Updated last year
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
pranavAL / DART
View on GitHub
Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024
☆11Jun 6, 2024Updated 2 years ago