Code base for internal reward models and PPO training
☆24Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for lmgym
Users that are interested in lmgym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆32Feb 26, 2025Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆27Aug 30, 2023Updated 2 years ago
- ☆18Dec 18, 2022Updated 3 years ago
- ☆12Sep 21, 2024Updated last year
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Jan 6, 2026Updated 2 months ago
- An interactive and API powered CLIPDraw☆26Nov 4, 2022Updated 3 years ago
- Tools for content datamining and NLP at scale☆44Jun 20, 2024Updated last year
- ☆29Apr 30, 2023Updated 2 years ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Multi-Domain Expert Learning☆67Jan 23, 2024Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- Seq2seq using LSTM with attention from Luong et al☆10Oct 2, 2018Updated 7 years ago
- A unified framework to evaluate path reasoning methods across multiple beyond accuracy dimension and path (explanation) quality perspecti…☆13Mar 20, 2024Updated 2 years ago
- Code for the DataPipes article☆15Jun 14, 2022Updated 3 years ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 3 months ago
- Official repository for ORPO☆473May 31, 2024Updated last year
- Lossless normalization of uppercase characters☆11Jul 3, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- TensorFlow implementation of the Traversal Network (TNet) architecture, presented in "Hard-Attention for Scalable Image Classification" (…☆17Jun 17, 2022Updated 3 years ago
- Utility for React components to easily subscribe to Mutant streams☆13Dec 9, 2017Updated 8 years ago
- ☆20Oct 12, 2024Updated last year
- Attempt at cog wrapper for segmind/SSD-1B☆10Dec 11, 2023Updated 2 years ago
- Top 9 private leaderboard & Top 17 public leaderboard☆10Dec 1, 2022Updated 3 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Effortlessly Create Engaging and Informative Threads in Minutes☆14Feb 3, 2023Updated 3 years ago
- Gradio UI for RWKV LLM☆27Feb 21, 2023Updated 3 years ago
- Collection of Hashicorp's Nomad☆13Apr 3, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Dec 21, 2017Updated 8 years ago
- code for CoRL 2020 paper "Contrastive Variational Model-Based Reinforcement Learning for Complex Observations"☆24Dec 29, 2021Updated 4 years ago
- NanoGPT (124M) quality in 2.67B tokens☆28Sep 17, 2025Updated 6 months ago
- USB Hid handler for nodejs☆11Sep 30, 2022Updated 3 years ago
- Rust AV1 Decoder☆15Jun 19, 2019Updated 6 years ago
- ☆37Nov 12, 2025Updated 4 months ago
- Rich text editor with AI text generation using TipTap project.☆12Nov 15, 2023Updated 2 years ago