An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the same gains from the ReasoningBank paper also applies to much smaller, less capable models.
☆103Oct 14, 2025Updated 6 months ago
Alternatives and similar repositories for reasoning-bank-slm
Users that are interested in reasoning-bank-slm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ME-GraphAU on Video☆11May 10, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆34Feb 28, 2026Updated last month
- ☆35Mar 13, 2026Updated last month
- ☆17Dec 8, 2023Updated 2 years ago
- Benchmark for Biophysical Sequence Optimization Algorithms☆21May 21, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Jul 9, 2023Updated 2 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 9 months ago
- A-MEM: Agentic Memory for LLM Agents☆319Mar 15, 2026Updated last month
- ☆97Mar 6, 2026Updated last month
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆27Nov 28, 2025Updated 4 months ago
- This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…☆25Sep 29, 2025Updated 6 months ago
- Just a template for quickly creating a python library.☆10Apr 13, 2026Updated last week
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web☆11Mar 8, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆35Jan 31, 2026Updated 2 months ago
- ☆18Jul 8, 2025Updated 9 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- ACL 2026☆26Nov 19, 2025Updated 5 months ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- ☆13Jul 14, 2024Updated last year
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 5 months ago
- Code of ICLR 2025 paper "DynaPrompt: Dynamic Test-Time Prompt Tuning"☆22Jan 29, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆26Sep 26, 2023Updated 2 years ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆31Updated this week
- API to extract data from wikiHow☆17Jul 10, 2021Updated 4 years ago
- ☆13Jan 14, 2026Updated 3 months ago
- Container-free RL framework for training software engineering agents☆53Mar 4, 2026Updated last month
- Implementation for Phenotype prediction from single-cell RNA-seq data using attention-based neural networks (Bioinformatics 2024).☆13Jul 15, 2024Updated last year
- ☆15Nov 19, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository is the official implementation of our AAAI 2025 accepted paper: "PhysAug: A Physical-guided and Frequency-based Data Aug…☆22May 16, 2025Updated 11 months ago
- [WIP] Code for LangToMo☆20Mar 19, 2026Updated last month
- Mainly on text documents. Implemented a Mini Search Engine using different algorithms and then summaried documents using lexrank.☆11Jan 19, 2018Updated 8 years ago
- ☆11Oct 31, 2021Updated 4 years ago
- Multi-label Node Classification☆14Jun 3, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆38Feb 9, 2026Updated 2 months ago