An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the same gains from the ReasoningBank paper also applies to much smaller, less capable models.
☆105Oct 14, 2025Updated 7 months ago
Alternatives and similar repositories for reasoning-bank-slm
Users that are interested in reasoning-bank-slm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆54Jan 25, 2026Updated 4 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 3 months ago
- ☆17Dec 8, 2023Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Jul 9, 2023Updated 2 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16May 14, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Multi-Objective GFlowNets"☆20Jul 12, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 10 months ago
- ☆27Jan 12, 2026Updated 4 months ago
- ☆114May 13, 2026Updated 2 weeks ago
- [CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition☆12Mar 20, 2024Updated 2 years ago
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆37Jan 31, 2026Updated 3 months ago
- ☆18Jul 8, 2025Updated 10 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Jul 14, 2024Updated last year
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- ACL 2026☆27Nov 19, 2025Updated 6 months ago
- Repo for the BBCAVS10k distribution☆10Nov 27, 2024Updated last year
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 6 months ago
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year
- Competitive Programming Code Template☆10Nov 6, 2022Updated 3 years ago
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆26Mar 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Jan 14, 2026Updated 4 months ago
- Code for the ECCV22 paper Demystifying Unsupervised Semantic Correspondence Estimation☆14Oct 18, 2022Updated 3 years ago
- Implementation for Phenotype prediction from single-cell RNA-seq data using attention-based neural networks (Bioinformatics 2024).☆13Jul 15, 2024Updated last year
- ☆15Nov 19, 2021Updated 4 years ago
- Mainly on text documents. Implemented a Mini Search Engine using different algorithms and then summaried documents using lexrank.☆11Jan 19, 2018Updated 8 years ago
- ☆11Oct 31, 2021Updated 4 years ago
- Multi-label Node Classification☆15Jun 3, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- Ever wondered how popular your GitHub repo is compared to others?☆17Feb 14, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆39Feb 9, 2026Updated 3 months ago
- ☆10Mar 24, 2023Updated 3 years ago
- The guideline for pod.☆10Jun 19, 2020Updated 5 years ago
- List of papers on video-centric robot learning☆23Nov 16, 2024Updated last year
- SGLang Kernel Wheel Index☆22May 22, 2026Updated last week
- Scribble-Supervised Semantic Segmentation by Uncertainty Reduction on Neural Representation and Self-Supervision on Neural Eigenspace, IC…☆23Apr 19, 2023Updated 3 years ago
- ☆12Feb 17, 2023Updated 3 years ago