maple-research-lab/SLOT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maple-research-lab/SLOT)

maple-research-lab / SLOT

☆112

Alternatives and similar repositories for SLOT

Users that are interested in SLOT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UbiquantAI / one-shot-em
View on GitHub
One-shot Entropy Minimization
☆190Jun 13, 2025Updated last year
QingyangZhang / EMPO
View on GitHub
[NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method
☆103Nov 24, 2025Updated 8 months ago
QingyangZhang / Label-Free-RLVR
View on GitHub
☆311Jul 6, 2025Updated last year
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 5 months ago
pprp / ACBench
View on GitHub
[ICML25] Agentic Compression Benchmark (ACBench)
☆17Jul 2, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ruixin31 / Spurious_Rewards
View on GitHub
☆361Jul 29, 2025Updated last year
PRIME-RL / TTRL
View on GitHub
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆1,103Apr 15, 2026Updated 3 months ago
TianHongZXY / RLVR-Decomposed
View on GitHub
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆166Mar 2, 2026Updated 4 months ago
xiwenc1 / DRA-GRPO
View on GitHub
Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
☆24Jan 6, 2026Updated 6 months ago
zwhong714 / PSFT
View on GitHub
[ICLR 2026] PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, co…
☆39Sep 9, 2025Updated 10 months ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
XLearning-SCU / 2025-ICML-VISA
View on GitHub
Official Implementation of Visual Abstraction: A Plug-and-Play Approach for Text-Visual Retrieval
☆26Jul 14, 2025Updated last year
Fhujinwu / TLM
View on GitHub
[ICML2025] Test-Time Learning for Large Language Models
☆57Jan 31, 2026Updated 5 months ago
JUNJIE99 / VISTA_Evaluation_FineTuning
View on GitHub
Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original c…
☆48Nov 16, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
GeWu-Lab / MokA
View on GitHub
MokA: Multimodal Low-Rank Adaptation for MLLMs
☆92Dec 30, 2025Updated 6 months ago
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
WNJXYK / DeCoOp
View on GitHub
☆16Jun 4, 2024Updated 2 years ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
DripNowhy / Sherlock
View on GitHub
[NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"
☆31Jun 4, 2026Updated last month
dropbox / low-rank-llama2
View on GitHub
Low-Rank Llama Custom Training
☆23Mar 27, 2024Updated 2 years ago
zxk1212 / OODD
View on GitHub
[CVPR2025] The implementation of the paper "OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary".
☆19May 9, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆446Jul 11, 2025Updated last year
RyanLiu112 / GenPRM
View on GitHub
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆102Nov 8, 2025Updated 8 months ago
AndreHe02 / rewarding-unlikely-release
View on GitHub
☆15Jun 10, 2025Updated last year
zqOuO / GWT
View on GitHub
☆13May 4, 2026Updated 2 months ago
zzh-SJTU / E5-Hierarchical-Table-Analysis
View on GitHub
The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …
☆15Jun 23, 2024Updated 2 years ago
hahahawu / Long-to-Short-via-Model-Merging
View on GitHub
Model merging is a highly efficient approach for long-to-short reasoning.
☆103Oct 15, 2025Updated 9 months ago
meituan / MemOCR
View on GitHub
MemOCR: an OCR-driven visual memory agent.
☆33May 17, 2026Updated 2 months ago
ZhangAIPI / YOPO_MLLM_Pruning
View on GitHub
Pruning the VLLMs
☆106Dec 9, 2024Updated last year
TEAM-ARM / arm
View on GitHub
[NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model
☆68Apr 6, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Optimization-AI / DisCO
View on GitHub
NeurIPS 2025: Discriminative Constrained Optimization for Reinforcing Large Reasoning Models
☆53Mar 14, 2026Updated 4 months ago
microsoft / data-efficacy
View on GitHub
Data Efficacy for Language Model Training
☆52May 29, 2026Updated 2 months ago
suu990901 / KlearReasoner
View on GitHub
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
☆82Dec 25, 2025Updated 7 months ago
L1aoXingyu / llm-infer-bench
View on GitHub
☆12Sep 1, 2023Updated 2 years ago
ypwang61 / One-Shot-RLVR
View on GitHub
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆444Mar 11, 2026Updated 4 months ago
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
elaine-sui / TPS
View on GitHub
☆24Dec 15, 2025Updated 7 months ago