SCoRe: Training Language Models to Self-Correct via Reinforcement Learning
☆16Jan 24, 2025Updated last year
Alternatives and similar repositories for SCoRe
Users that are interested in SCoRe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 11 months ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated 3 weeks ago
- Unofficial Implementation of Selective Attention Transformer☆21Oct 31, 2024Updated last year
- ☆13Jul 14, 2024Updated last year
- ☆12Feb 27, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Oct 12, 2024Updated last year
- ☆11Jun 16, 2024Updated last year
- ☆14Apr 16, 2024Updated last year
- Binding Affinity Prediction using Deep learning models☆12Jun 9, 2021Updated 4 years ago
- ⚠️ ARCHIVED - All development moved to https://github.com/itbench-hub/ITBench/tree/main/scenarios☆15Feb 24, 2026Updated last month
- Analytical chemistry and epidemiology of street drugs☆25Aug 26, 2025Updated 7 months ago
- ☆22Oct 22, 2024Updated last year
- ☆28Jan 4, 2026Updated 3 months ago
- ☆54Feb 12, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Nov 25, 2025Updated 4 months ago
- Learning Protein-Ligand Properties with Atomic Environment Vectors☆10Apr 19, 2024Updated last year
- ☆51May 11, 2025Updated 11 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- Implementation of AdaCQR(COLING 2025)☆15Dec 30, 2024Updated last year
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- Dissecting the weight space of neural networks☆18Apr 16, 2021Updated 4 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- Constrained Decoding Project☆20Nov 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This is a PyTorch implementation of a Transformer Decoder based model that plays chess.☆17Mar 15, 2024Updated 2 years ago
- 读图时代,从「花瓣」(huaban.com)上阅读每日更新的数据图☆47Dec 26, 2014Updated 11 years ago
- Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking☆13Jun 22, 2023Updated 2 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- Contrastive learning and pre-trained encoder (CLAPE) for protein-small molecules binding (SMB) sites prediction☆19Aug 22, 2024Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆52Oct 31, 2024Updated last year
- 哈工大模式识别与深度学习实验☆15Jun 20, 2022Updated 3 years ago
- INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions☆16Jan 21, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆16Mar 30, 2023Updated 3 years ago
- This is the repo for remote direct memory introspection.☆24Jun 21, 2023Updated 2 years ago
- ☆16Aug 14, 2019Updated 6 years ago
- GenRM-CoT: Data release for verification rationales☆67Oct 16, 2024Updated last year
- An innovative application designed to help pharmacists and pharmacy students quickly research FDA-approved drugs by retrieving relevant i…☆24Mar 24, 2025Updated last year
- ☆33May 27, 2025Updated 10 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆479Feb 19, 2026Updated last month