Infini-AI-Lab / GRESOLinks

☆69

Alternatives and similar repositories for GRESO

Users that are interested in GRESO are comparing it to the libraries listed below

Sorting:

Infini-AI-Lab / Kinetics
Kinetics: Rethinking Test-Time Scaling Laws
☆82Updated 4 months ago
Infini-AI-Lab / Multiverse
☆104Updated 2 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆40Updated last month
ryoungj / BoLT
Code for "Reasoning to Learn from Latent Thoughts"
☆122Updated 8 months ago
UCSB-NLP-Chang / ThinkPrune
☆45Updated 2 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆89Updated 2 weeks ago
uservan / ThinkPO
☆17Updated 4 months ago
kamanphoebe / Look-into-MoEs
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆55Updated 9 months ago
Gen-Verse / CURE
[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
☆135Updated 2 months ago
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆62Updated last year
PKU-ML / LongPPL
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆105Updated last month
LAMDASZ-ML / Self-Backtracking
☆51Updated 9 months ago
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆89Updated last year
sail-sg / VeriFree
Reinforcing General Reasoning without Verifiers
☆92Updated 5 months ago
sail-sg / AnytimeReasoner
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆48Updated 4 months ago
Linking-ai / SCOPE
(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
☆33Updated 6 months ago
Infini-AI-Lab / S2FT
☆19Updated 11 months ago
sail-sg / LongSpec
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
☆68Updated 4 months ago
holarissun / RewardModelingBeyondBradleyTerry
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…
☆69Updated 8 months ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆50Updated 9 months ago
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆118Updated 6 months ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆31Updated 4 months ago
OpenSparseLLMs / Linear-MoE
☆120Updated 5 months ago
OpenSparseLLMs / Linearization
☆61Updated 4 months ago
sail-sg / feedback-conditional-policy
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆53Updated 2 months ago
alessiodevoto / l2compress
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆17Updated 11 months ago
TIGER-AI-Lab / AceCoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆94Updated 7 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆142Updated 4 months ago
hkust-nlp / model-task-align-rl
The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆15Updated 3 months ago
Kwai-Klear / RLEP
RL with Experience Replay
☆49Updated 4 months ago