Code base for internal reward models and PPO training
☆24Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for lmgym
Users that are interested in lmgym are comparing it to the libraries listed below
Sorting:
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- Fast whitespace correction with Transformers☆17Aug 22, 2025Updated 6 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆31Feb 26, 2025Updated last year
- ☆18Dec 18, 2022Updated 3 years ago
- ☆20Oct 12, 2024Updated last year
- Algorithms for optimization tasks (operations research)☆19Sep 11, 2023Updated 2 years ago
- Bản dịch tiếng Việt của 100 bài luyện tập NLP (cập nhật bản 2020) dịch từ 言語処理100本ノック 2020 (https://nlp100.github.io/ja)☆24Jun 8, 2020Updated 5 years ago
- ☆32Mar 30, 2023Updated 2 years ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Mar 16, 2023Updated 2 years ago
- Tools for content datamining and NLP at scale☆44Jun 20, 2024Updated last year
- First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and saf…☆51Dec 3, 2025Updated 3 months ago
- Self-evaluating RAG application on LangCheck docs☆11Sep 10, 2025Updated 5 months ago
- Simple getting started procedure for SciCat☆11Updated this week
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Amplify your coding capabilities with AI - your smart co-pilot for an elevated coding experience.☆14Feb 18, 2026Updated 2 weeks ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆30Updated this week
- ☆14Mar 21, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated 2 months ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- gammcor code☆11Sep 25, 2025Updated 5 months ago
- A Python client library for accessing IQM quantum computers☆12Mar 26, 2025Updated 11 months ago
- ☆16Feb 22, 2025Updated last year
- A NOMAD plugin containing base sections for material processing.☆11Jan 20, 2026Updated last month
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- IonQ iQuHACK 2024 Remote Challenge☆11Feb 3, 2024Updated 2 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- Neural Networks for penetration testing. Part of active research.☆13Jun 21, 2022Updated 3 years ago
- ☆10Jul 12, 2019Updated 6 years ago
- ☆14Nov 11, 2024Updated last year
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago