SalesforceAIResearch/Elastic-Reasoning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SalesforceAIResearch/Elastic-Reasoning)

SalesforceAIResearch / Elastic-Reasoning

Make reasoning models scalable

☆51

Alternatives and similar repositories for Elastic-Reasoning

Users that are interested in Elastic-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SalesforceAIResearch / ThinK
View on GitHub
ThinK: Thinner Key Cache by Query-Driven Pruning
☆30Jun 2, 2026Updated last month
BaohaoLiao / frac-cot
View on GitHub
[COLM 2026] An efficient 3D sampling method for long-CoT LLM.
☆16May 25, 2025Updated last year
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
yuhuixu1993 / WLQ
View on GitHub
caffe implementation of single level quantization
☆19Dec 15, 2018Updated 7 years ago
BaohaoLiao / RSD
View on GitHub
[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.
☆56May 2, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 5 months ago
Lucky-Lance / SPP
View on GitHub
[ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
☆22May 28, 2024Updated 2 years ago
UCSC-VLAA / Complex-Edit
View on GitHub
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆29Apr 22, 2025Updated last year
Red-Hat-AI-Innovation-Team / SQuat
View on GitHub
☆22Jun 5, 2025Updated last year
eric-haibin-lin / verl-data
View on GitHub
☆14May 12, 2025Updated last year
UCSC-VLAA / FedConv
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…
☆25Apr 30, 2024Updated 2 years ago
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
KevinLee1110 / dynamic-batching
View on GitHub
The official repo for the paper "Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching"
☆18Mar 17, 2025Updated last year
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
BAI-LAB / MoE-CL
View on GitHub
[WWW 2026 Oral] MoE-CL:Self-Evolving LLMs via Continual Instruction Tuning
☆21Dec 1, 2025Updated 7 months ago
MiroMindAI / MiroEval
View on GitHub
MiroEval: A benchmark and evaluation framework for deep research agents — 100 tasks (70 text, 30 multimodal) assessed across synthesis qu…
☆46Jul 6, 2026Updated 2 weeks ago
ZHITENGLI / AdaSVD
View on GitHub
PyTorch code for our paper "AdaSVD: Adaptive Singular Value Decomposition for Large Language Models"
☆15Mar 9, 2025Updated last year
callsys / GMPO
View on GitHub
[ICLR 2026] Geometric-Mean Policy Optimization
☆104Jan 26, 2026Updated 5 months ago
ylsung / rsq
View on GitHub
Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"
☆23Mar 25, 2026Updated 3 months ago
yuhuixu1993 / qa-lora
View on GitHub
Official PyTorch implementation of QA-LoRA
☆147Mar 13, 2024Updated 2 years ago
MingLiiii / ThinkARM
View on GitHub
Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models
☆27Dec 21, 2025Updated 7 months ago
BradMcDanel / sdgp
View on GitHub
☆10Feb 1, 2022Updated 4 years ago
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aiming-lab / MIRA
View on GitHub
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought
☆31Feb 14, 2026Updated 5 months ago
THU-KEG / AdaptThink
View on GitHub
☆186Dec 5, 2025Updated 7 months ago
FFTYYY / RaanA
View on GitHub
Implementation of "RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm"
☆17Apr 11, 2025Updated last year
zer0int / CLIP-ViT-visualization
View on GitHub
What do CLIP Vision Transformers learn? Feature Visualization can show you!
☆15Aug 29, 2024Updated last year
danczs / NetworkAdjustment
View on GitHub
☆12Jul 7, 2021Updated 5 years ago
VainF / Thinkless
View on GitHub
[NeurIPS 2025] Thinkless: LLM Learns When to Think
☆261Sep 26, 2025Updated 9 months ago
zzwjames / FailureLLMUnlearning
View on GitHub
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
☆39Feb 22, 2025Updated last year
ipsitmantri / DiTASK
View on GitHub
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations (CVPR 2025)
☆14Jun 1, 2025Updated last year
StarDewXXX / Awesome-Hybrid-CoT-Reasoning
View on GitHub
☆62Jun 7, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
slm-mux / SLM-MUX
View on GitHub
☆25Mar 26, 2026Updated 3 months ago
MLSysOps / Code-Agent-Survey
View on GitHub
A survey of Code Agents / Foundation Models for improving development productivity. Become 10x SWE, MLE, etc.
☆22Aug 20, 2024Updated last year
ruikangliu / Quantized-Reasoning-Models
View on GitHub
[COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"
☆77Jul 8, 2025Updated last year
ScienceOne-AI / AutoThink
View on GitHub
AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead …
☆52Oct 14, 2025Updated 9 months ago
smiles724 / MNPO
View on GitHub
The official code of Multi-player Nash Preference Optimization [ICLR 2026]
☆35Feb 4, 2026Updated 5 months ago
yaof20 / DenseMixer
View on GitHub
Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient
☆68Aug 3, 2025Updated 11 months ago
xlang-ai / OSWorld-G
View on GitHub
[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis
☆172Jun 18, 2026Updated last month