An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the same gains from the ReasoningBank paper also applies to much smaller, less capable models.
☆107Oct 14, 2025Updated 8 months ago
Alternatives and similar repositories for reasoning-bank-slm
Users that are interested in reasoning-bank-slm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆57Jun 8, 2026Updated last week
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆32Jan 3, 2026Updated 5 months ago
- Evaluation of voting systems in Python.☆18Oct 26, 2025Updated 7 months ago
- ☆17Dec 8, 2023Updated 2 years ago
- ☆45Apr 28, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Benchmark for Biophysical Sequence Optimization Algorithms☆22Apr 15, 2026Updated 2 months ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Jul 9, 2023Updated 2 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16May 14, 2026Updated last month
- Code for "Multi-Objective GFlowNets"☆20Jul 12, 2023Updated 2 years ago
- ☆16Nov 9, 2025Updated 7 months ago
- ☆17Jun 30, 2025Updated 11 months ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆28Nov 28, 2025Updated 6 months ago
- uncertainty-guided matting on ICML2023☆12Aug 3, 2023Updated 2 years ago
- ☆125May 13, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web☆11Mar 8, 2021Updated 5 years ago
- Evolving LangChain agent architectures using the Quality-Diversity (QD) algorithm.☆16Aug 29, 2025Updated 9 months ago
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- [ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning☆15Aug 2, 2023Updated 2 years ago
- Some Pwn Challenges from winesap.☆14Aug 15, 2019Updated 6 years ago
- Skills to augment LLM thinking process, integrated with InfraNodus insight generation tool☆95Apr 24, 2026Updated last month
- 西电操作系统课设避坑指南☆10Sep 7, 2020Updated 5 years ago
- ☆22Mar 23, 2026Updated 2 months ago
- Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models☆21May 29, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year
- Competitive Programming Code Template☆10Nov 6, 2022Updated 3 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆26Sep 26, 2023Updated 2 years ago
- Persian Word Embedding Using FastText Pre-trained Model☆13May 29, 2026Updated 3 weeks ago
- (TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"☆26Mar 14, 2025Updated last year
- ☆13Jan 14, 2026Updated 5 months ago
- 💾A moleculer service mixin for minio and S3 💾☆15Sep 16, 2022Updated 3 years ago
- ☆16Nov 19, 2021Updated 4 years ago
- This repository is the official implementation of our AAAI 2025 accepted paper: "PhysAug: A Physical-guided and Frequency-based Data Aug…☆24May 16, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Aug 5, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- A bridge to launch managed applications (.NET) into MS signed exe via dll injection☆13Aug 29, 2020Updated 5 years ago
- ☆16Jan 12, 2023Updated 3 years ago
- Hexagon-MLIR is a compiler toolchain for compiling and executing AI kernels and models on Qualcomm Hexagon Neural Processing Units (NPUs)…☆155Jun 3, 2026Updated 2 weeks ago
- The guideline for pod.☆10Jun 19, 2020Updated 6 years ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆41Apr 13, 2026Updated 2 months ago