SCoRe: Training Language Models to Self-Correct via Reinforcement Learning
☆16Jan 24, 2025Updated last year
Alternatives and similar repositories for SCoRe
Users that are interested in SCoRe are comparing it to the libraries listed below
Sorting:
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- ☆19Mar 25, 2025Updated 11 months ago
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- ☆11May 16, 2025Updated 9 months ago
- Concurrency library☆17Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- ☆44May 6, 2025Updated 9 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆45Feb 18, 2025Updated last year
- pdf to markdown with Python3☆11Oct 30, 2019Updated 6 years ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- LLM Skirmish☆44Feb 3, 2026Updated 3 weeks ago
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 8 months ago
- ☆11Jan 11, 2022Updated 4 years ago
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆15Sep 15, 2025Updated 5 months ago
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ☆10Apr 7, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- ☆12Jul 4, 2024Updated last year
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Code repository for scenarios and environment setup as part of ITBench☆15Feb 19, 2026Updated last week
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- ☆11Jun 16, 2024Updated last year
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- MPLS VPNs (VPLS, VPWS, L3VPN) on eNSP using Huawei Routers☆11Feb 11, 2020Updated 6 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- ☆14Mar 21, 2024Updated last year
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆110Oct 11, 2025Updated 4 months ago
- ☆47Nov 8, 2024Updated last year
- Official Repo of SimTeG☆43Mar 29, 2024Updated last year