chai-research / lmgymView external linksLinks
Code base for internal reward models and PPO training
☆24Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for lmgym
Users that are interested in lmgym are comparing it to the libraries listed below
Sorting:
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- ☆12Dec 15, 2022Updated 3 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆30Feb 26, 2025Updated 11 months ago
- ☆18Dec 18, 2022Updated 3 years ago
- Chat data cleaning, filtering and deduplication pipeline.☆21Jul 25, 2023Updated 2 years ago
- ☆20Oct 12, 2024Updated last year
- ☆27Aug 30, 2023Updated 2 years ago
- ☆32Mar 30, 2023Updated 2 years ago
- GPI-Space: Memory Driven Computing and Big Data☆10Jan 2, 2025Updated last year
- Tools for content datamining and NLP at scale☆44Jun 20, 2024Updated last year
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆50Dec 3, 2025Updated 2 months ago
- Self-evaluating RAG application on LangCheck docs☆11Sep 10, 2025Updated 5 months ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆13Feb 2, 2026Updated 2 weeks ago
- ☆14Mar 21, 2024Updated last year
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated last month
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- ☆16Feb 22, 2025Updated 11 months ago
- Progress Web App template for Scripture App Builder☆13Updated this week
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Simple getting started procedure for SciCat☆11Updated this week
- Amplify your coding capabilities with AI - your smart co-pilot for an elevated coding experience.☆14Feb 9, 2026Updated last week
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- code for polite☆11Feb 28, 2024Updated last year
- A static website for a Chatbot with Azure OpenAI, Azure Text to Speech Services and Live2D☆13Sep 4, 2024Updated last year
- A Python client library for accessing IQM quantum computers☆13Mar 26, 2025Updated 10 months ago
- ☆37Nov 12, 2025Updated 3 months ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- Open-source npm package to quickly create a basic set-up for applications using Express and Socket.io☆11Dec 15, 2020Updated 5 years ago
- A Rust implementation of the Handshake and Lightning Network secure messaging protocol - based on Noise.☆14Dec 9, 2019Updated 6 years ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- Quandela iQuHACK 2024 Remote Challenge☆10Feb 4, 2024Updated 2 years ago
- Python package to infer tensions and pressures from 3D microscopy images of embryos and tissues☆12Nov 4, 2025Updated 3 months ago