SCoRe: Training Language Models to Self-Correct via Reinforcement Learning
☆16May 14, 2026Updated 3 weeks ago
Alternatives and similar repositories for SCoRe
Users that are interested in SCoRe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 7, 2025Updated last year
- ☆50Jan 6, 2026Updated 5 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated last year
- ☆13May 16, 2025Updated last year
- Official source code of ICDM2023 paper "Hypergraph Contrastive Learning for Drug Trafficking Community Detection".☆11Nov 3, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Apr 18, 2020Updated 6 years ago
- 🏛️ Directive · OpenClaw Multi-Agent Orchestration System — 10 AI agents modeled after the U.S. Federal Executive Branch. Dual independ…☆73Mar 8, 2026Updated 3 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- A simple, elegant web tool that allows you to create custom RSS feeds for arXiv search queries. Stay up-to-date with the latest research …☆36Mar 21, 2026Updated 2 months ago
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆23May 5, 2025Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- ☆13Jul 14, 2024Updated last year
- ☆10Dec 26, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- pdf to markdown with Python3☆11Oct 30, 2019Updated 6 years ago
- ☆12Feb 27, 2025Updated last year
- ☆14Oct 12, 2024Updated last year
- Detection of rootkit file hiding activities through analysis of shifts in kernel function execution times.☆29Sep 10, 2025Updated 9 months ago
- MPLS VPNs (VPLS, VPWS, L3VPN) on eNSP using Huawei Routers☆11Feb 11, 2020Updated 6 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- ☆11Jun 16, 2024Updated last year
- A personal AI therapist to help you with your mental health☆26Nov 29, 2025Updated 6 months ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Binding Affinity Prediction using Deep learning models☆12Jun 9, 2021Updated 5 years ago
- ☆22Oct 22, 2024Updated last year
- ☆27May 12, 2026Updated last month
- ☆52Feb 12, 2025Updated last year
- ☆52May 11, 2025Updated last year
- Implementation of AdaCQR(COLING 2025)☆15Dec 30, 2024Updated last year
- This project implements a Reinforcement Learning (RL) enhanced Retrieval-Augmented Generation (RAG) system that optimizes document retrie…☆25Apr 27, 2025Updated last year
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- Constrained Decoding Project☆20Nov 10, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a PyTorch implementation of a Transformer Decoder based model that plays chess.☆17Mar 15, 2024Updated 2 years ago
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆143Sep 21, 2024Updated last year
- Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking☆13Jun 22, 2023Updated 2 years ago
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated last year
- ☆10Mar 6, 2023Updated 3 years ago
- A multi-functional ESP32 keyboard with 5 customizable keys, with alias called "ESP32 Keybrick"☆31Updated this week