chai-research/lmgym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chai-research/lmgym)

chai-research / lmgym

Code base for internal reward models and PPO training

☆24

Alternatives and similar repositories for lmgym

Users that are interested in lmgym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lampts / chatgpt-mle-interview
View on GitHub
ChatGPT solutions for the MLE interview
☆14Dec 9, 2022Updated 3 years ago
anhvth / WKaraokeMaker
View on GitHub
☆12Dec 15, 2022Updated 3 years ago
yihedeng9 / DuoGuard
View on GitHub
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
☆34Feb 26, 2025Updated last year
ad-freiburg / whitespace-correction
View on GitHub
Fast whitespace correction with Transformers
☆18Aug 22, 2025Updated 10 months ago
kyegomez / VisionLLaMA
View on GitHub
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
☆15Nov 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ai-wand / concise-reasoning
View on GitHub
Concise Reasoning via Reinforcement Learning
☆13Apr 16, 2025Updated last year
fishiatee / yawullm
View on GitHub
Yet Another (LLM) Web UI, made with Gemini
☆12Dec 25, 2024Updated last year
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
vancoder1 / AsukaAI
View on GitHub
Local AI companion
☆17May 22, 2025Updated last year
LazerCuber / Waifu-AI-Dev
View on GitHub
Live2D Waifu with TTS support (Please use the Beta Branch)
☆11Apr 5, 2026Updated 3 months ago
ikegami-yukino / coding-tips
View on GitHub
ど忘れしたときのためのメモ
☆10Mar 13, 2026Updated 3 months ago
xuerongchuan / ICME2019competition
View on GitHub
短视频内容理解与推荐竞赛
☆12Feb 18, 2019Updated 7 years ago
shisa-ai / shaberi
View on GitHub
Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda
☆19Apr 29, 2026Updated 2 months ago
opencog / pln
View on GitHub
[NO LONGER MAINTAINED, SUPERSEDED BY https://github.com/trueagi-io/pln-experimental and https://github.com/trueagi-io/PLN]. Probabilisti…
☆16Sep 20, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
minhpqn / nlp_100_drill_exercises_ver_2020
View on GitHub
Bản dịch tiếng Việt của 100 bài luyện tập NLP (cập nhật bản 2020) dịch từ 言語処理100本ノック 2020 (https://nlp100.github.io/ja)
☆24Jun 8, 2020Updated 6 years ago
zhiao777774 / awesome-personalized-lm
View on GitHub
A curated list of personalized Language model / Large language model (continually updated)
☆10Nov 17, 2023Updated 2 years ago
allenai / learning_from_interaction
View on GitHub
Learning about objects and their properties by interacting with them
☆12Oct 21, 2020Updated 5 years ago
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
yangarbiter / rare-spurious-correlation
View on GitHub
Understanding Rare Spurious Correlations in Neural Network
☆12Jun 5, 2022Updated 4 years ago
sb-jang / kodialogbench
View on GitHub
Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…
☆18Apr 15, 2025Updated last year
peppertaco / Tavern
View on GitHub
☆31Apr 30, 2023Updated 3 years ago
mengdi-li / internally-rewarded-rl
View on GitHub
[ICML 2023] Code for paper "Internally Rewarded Reinforcement Learning"
☆13Jul 21, 2023Updated 2 years ago
gabrieletiboni / random-envs
View on GitHub
Collection of gym environments with support for domain randomization
☆10Dec 11, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sarrrrry / PyTorchDL_GTA5
View on GitHub
This is the repository for loading GTA5 Dataset with PyTorch
☆12Nov 22, 2022Updated 3 years ago
huu4ontocord / MDEL
View on GitHub
Multi-Domain Expert Learning
☆67Jan 23, 2024Updated 2 years ago
PicoCreator / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆10Nov 3, 2023Updated 2 years ago
Makkarakiska / MaSuiteCore
View on GitHub
☆11Dec 28, 2020Updated 5 years ago
gianfelton / RFM-Segmentation-with-Quartiles-Jenks-Natural-Breaks-and-HDBSCAN
View on GitHub
☆10Jul 12, 2019Updated 6 years ago
SamLynnEvans / LSTM_with_attention
View on GitHub
Seq2seq using LSTM with attention from Luong et al
☆10Oct 2, 2018Updated 7 years ago
senya-ashukha / simple-gradient-boosting
View on GitHub
Very simple and short implementation of gradient boosting in 18 lines of code
☆10Sep 17, 2020Updated 5 years ago
naitri / SFM
View on GitHub
Structure From Motion : A python implementation to reconstruct a 3D scene and obtain camera poses with respect to scene
☆11Nov 16, 2022Updated 3 years ago
oKatanaaa / kolibrify
View on GitHub
Curriculum training of instruction-following LLMs with Unsloth
☆14Dec 15, 2025Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xfactlab / orpo
View on GitHub
Official repository for ORPO
☆481May 31, 2024Updated 2 years ago
RuisongZhou / icme2019
View on GitHub
字节跳动短视频理解比赛，代码带详细注释
☆18Apr 3, 2019Updated 7 years ago
avidale / dialogic
View on GitHub
Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook
☆29Mar 16, 2023Updated 3 years ago
kvn219 / cluttered-mnist
View on GitHub
Experiments on cluttered mnist dataset with Tensorflow.
☆20Feb 3, 2017Updated 9 years ago
chrreisinger / OpenVC
View on GitHub
OpenVC, an open source VHDL compiler/simulator
☆20Oct 7, 2012Updated 13 years ago
staltz / mutant-attachable
View on GitHub
Utility for React components to easily subscribe to Mutant streams
☆13Dec 9, 2017Updated 8 years ago
hanggao-gh / InteractiveMemorySharingLLM
View on GitHub
☆22Oct 12, 2024Updated last year