An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the same gains from the ReasoningBank paper also applies to much smaller, less capable models.
☆104Oct 14, 2025Updated 6 months ago
Alternatives and similar repositories for reasoning-bank-slm
Users that are interested in reasoning-bank-slm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Feb 8, 2026Updated 3 months ago
- ☆10Aug 22, 2023Updated 2 years ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- ☆41Apr 28, 2026Updated last week
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆31Jan 3, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Evaluation of voting systems in Python.☆16Oct 26, 2025Updated 6 months ago
- Benchmark for Biophysical Sequence Optimization Algorithms☆22Apr 15, 2026Updated 3 weeks ago
- Lean formalizations for the paper "Fel's conjecture on syzigies of numerical semigroups"☆40Mar 25, 2026Updated last month
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Jul 9, 2023Updated 2 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- Official implementation of NeurIPS'23 paper "Sample-efficient Multi-objective Molecular Optimization with GFlowNets"☆20Dec 24, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 10 months ago
- A-MEM: Agentic Memory for LLM Agents☆334Mar 15, 2026Updated last month
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXi…☆40Nov 9, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆27Nov 28, 2025Updated 5 months ago
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆36Jan 31, 2026Updated 3 months ago
- ☆18Jul 8, 2025Updated 10 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- ☆13Jul 14, 2024Updated last year
- 🧌 Live2d models for cnblog themes.☆13Apr 3, 2022Updated 4 years ago
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- Repo for the BBCAVS10k distribution☆10Nov 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Apr 13, 2026Updated 3 weeks ago
- PyTorch implements `Image Super-Resolution Using Very Deep Residual Channel Attention Networks` paper.☆15Dec 6, 2022Updated 3 years ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 6 months ago
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆25Feb 27, 2025Updated last year
- Code of ICLR 2025 paper "DynaPrompt: Dynamic Test-Time Prompt Tuning"☆22Jan 29, 2025Updated last year
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year
- Hexagon-MLIR is a compiler toolchain for compiling and executing AI kernels and models on Qualcomm Hexagon Neural Processing Units (NPUs)…☆133May 1, 2026Updated last week
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆26Mar 14, 2025Updated last year
- ☆13Jan 14, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Container-free RL framework for training software engineering agents☆56Mar 4, 2026Updated 2 months ago
- Implementation for Phenotype prediction from single-cell RNA-seq data using attention-based neural networks (Bioinformatics 2024).☆13Jul 15, 2024Updated last year
- 💾A moleculer service mixin for minio and S3 💾☆15Sep 16, 2022Updated 3 years ago
- ☆15Nov 19, 2021Updated 4 years ago
- Volcengine Object Storage(TOS) JavaScript SDK☆11Apr 7, 2026Updated last month
- Automated Python bot for securing limited-edition Labubu collectibles from Pop Mart. Features proxy rotation, anti-detection, and schedul…☆19Aug 6, 2025Updated 9 months ago
- [WIP] Code for LangToMo☆21Mar 19, 2026Updated last month