RLHFlow/GVM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLHFlow/GVM)

RLHFlow / GVM

☆16

Alternatives and similar repositories for GVM

Users that are interested in GVM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / implicit_languagemodels
View on GitHub
☆10Jul 7, 2025Updated last year
RLHFlow / Minimal-RL
View on GitHub
☆275May 14, 2025Updated last year
tmlr-group / CoPA
View on GitHub
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Nov 15, 2024Updated last year
tmlr-group / BayesianLM
View on GitHub
[NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"
☆12Dec 20, 2024Updated last year
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
reissbaker / clevergpt
View on GitHub
Training GPTs to solve interaction nets
☆18Aug 14, 2024Updated last year
tmlr-group / SCT
View on GitHub
[NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"
☆13Oct 28, 2024Updated last year
WeiXiongUST / Decentralized-Proximal-Algorithm-with-Variance-Reduction
View on GitHub
This is the code used for the paper "PMGT-VR: A decentralized proximal-gradient algorithmic framework with variance reduction", prepint.
☆15Jul 2, 2022Updated 4 years ago
dinobby / Skill-MoE
View on GitHub
The code implementation of Skill-MoE
☆46May 22, 2026Updated last month
uiuc-kang-lab / leap
View on GitHub
[VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data
☆20Nov 3, 2025Updated 8 months ago
LFhase / HIGHT
View on GitHub
[ICML 2025] Hierarchical Graph Tokenization for Molecule-Language Alignment
☆16Aug 18, 2025Updated 11 months ago
HazyResearch / aioli
View on GitHub
Aioli: A unified optimization framework for language model data mixing
☆33Jan 17, 2025Updated last year
microsoft / RLHF-APA
View on GitHub
RL algorithm: Advantage induced policy alignment
☆66Aug 11, 2023Updated 2 years ago
HazyResearch / smoothie
View on GitHub
☆15Dec 10, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zkshan2002 / RTO
View on GitHub
☆22Jun 4, 2025Updated last year
alon-albalak / TLiDB
View on GitHub
Transfer Learning in Dialogue Benchmarking Toolkit
☆14Mar 31, 2023Updated 3 years ago
VirtuosoResearch / Multi-source-learning-repository
View on GitHub
Recent papers and projects in multitask learning and their applications
☆10Jul 8, 2026Updated last week
general-preference / general-preference-model
View on GitHub
[ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)
☆43Jun 15, 2026Updated last month
GAUnion / LeetCode-Daily
View on GitHub
☆10May 25, 2020Updated 6 years ago
qiancheng0 / EscapeBench
View on GitHub
This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box
☆18Dec 19, 2024Updated last year
chao1224 / SGNN-EBM
View on GitHub
Structured Multi-task Learning for Molecular Property Prediction, AISTATS'22 (https://proceedings.mlr.press/v151/liu22e.html)
☆14Jul 6, 2022Updated 4 years ago
stratisMarkou / sample-efficient-bayesian-rl
View on GitHub
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
☆25Apr 14, 2022Updated 4 years ago
frankxwang / dpo-prefix-sharing
View on GitHub
DPO, but faster 🚀
☆52Dec 6, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GauravGajbhiye / SCAMET_RSIC
View on GitHub
This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.
☆13Aug 10, 2023Updated 2 years ago
kschweig / OfflineRL
View on GitHub
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
☆26Jan 16, 2023Updated 3 years ago
LLaMafia / SFT_function_learning
View on GitHub
Explore what LLMs are really leanring over SFT
☆28Mar 30, 2024Updated 2 years ago
CogComp / TAWT
View on GitHub
Weighted Training for Cross-Task Learning
☆15Feb 12, 2023Updated 3 years ago
alecwangcq / f-divergence-dpo
View on GitHub
Direct preference optimization with f-divergences.
☆17Nov 3, 2024Updated last year
yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning
View on GitHub
This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…
☆32Dec 5, 2024Updated last year
dongxinshuai / RIFT-NeurIPS2021
View on GitHub
☆11Mar 6, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Arenaa / Accelerated-Generation-Techniques
View on GitHub
This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).
☆11May 24, 2024Updated 2 years ago
jejjohnson / research_notebook
View on GitHub
My personal research notebook with notes, tutorials, and resources written in Jupyterbook.
☆21Jun 30, 2026Updated 2 weeks ago
Zanette-Labs / speed-rl
View on GitHub
☆18Feb 2, 2026Updated 5 months ago
tmlr-group / NegLabel
View on GitHub
[ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"
☆21Oct 23, 2024Updated last year
shirley-wu / daco
View on GitHub
[NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
☆14Mar 5, 2025Updated last year
VirtuosoResearch / Generalization-in-graph-neural-networks
View on GitHub
Measuring generalization properties of graph neural networks
☆15Aug 11, 2025Updated 11 months ago
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago