MikaStars39/PeRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MikaStars39/PeRL)

MikaStars39 / PeRL

PeRL: Parameter-Efficient Reinforcement Learning

☆82

Alternatives and similar repositories for PeRL

Users that are interested in PeRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Linzwcs / echos
View on GitHub
Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.
☆55Nov 10, 2025Updated 8 months ago
Sphere-AI-Lab / PEFT-Arena
View on GitHub
Official repository of PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective
☆29Jun 13, 2026Updated last month
Joluck / MiSS
View on GitHub
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…
☆35Mar 9, 2026Updated 4 months ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
yynil / RWKVInside
View on GitHub
☆41Apr 30, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ssmisya / PolicyShiftGuard
View on GitHub
PolicyShiftGuard: Benchmarking and Improving Policy-Adaptive Image Guardrails
☆22Jul 8, 2026Updated 2 weeks ago
Triang-jyed-driung / rwkv7mini
View on GitHub
RWKV-7 mini
☆12Mar 29, 2025Updated last year
Sphere-AI-Lab / fda
View on GitHub
Implementation of <Model Merging with Functional Dual Anchors>
☆46Nov 23, 2025Updated 8 months ago
RWKV-Vibe / rwkv-fla
View on GitHub
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
☆50Apr 2, 2026Updated 3 months ago
circle-hit / SAPT
View on GitHub
Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …
☆40Jan 13, 2025Updated last year
ZHITENGLI / AdaSVD
View on GitHub
PyTorch code for our paper "AdaSVD: Adaptive Singular Value Decomposition for Large Language Models"
☆15Mar 9, 2025Updated last year
lcqysl / VideoSSR
View on GitHub
[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"
☆41Nov 11, 2025Updated 8 months ago
JanTempus / tokenisation_lp
View on GitHub
☆15May 20, 2026Updated 2 months ago
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
LINs-lab / LIE
View on GitHub
[preprint] Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
☆19Feb 18, 2026Updated 5 months ago
sileod / reasoning-core
View on GitHub
Procedural data generators for verifiable reasoning, synthetic pretraining, post-training, evaluation, and RL.
☆45Updated this week
youngjoey-ai / tracerag
View on GitHub
一个强调工程化、可观测、可测试、可扩展的 RAG 项目。TraceRAG 的目标不是只把答案“生成出来”，而是把文档导入、切块、向量化、检索、带来源回答、评估与后续 tracing 拆成可独立验证的阶段，逐步演进成一个可维护、可解释、可复盘的生产级 RAG。
☆15Apr 2, 2026Updated 3 months ago
thunlp / JustRL
View on GitHub
[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe
☆291Jun 29, 2026Updated 3 weeks ago
Sphere-AI-Lab / orbit
View on GitHub
Stable and Efficient Reinforcement Learning for Trillion-Parameter LLMs
☆148Jun 28, 2026Updated 3 weeks ago
Trustworthy-ML-Lab / ThinkEdit
View on GitHub
[EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…
☆19Dec 17, 2025Updated 7 months ago
ssmisya / AdaReasoner
View on GitHub
[ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".
☆83Feb 27, 2026Updated 4 months ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
edwardmilsom / function-space-learning-rates-paper
View on GitHub
Code for the paper "Function-Space Learning Rates"
☆23Jun 3, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Mia-Cong / SWIFT
View on GitHub
Official implementation of "Can Test-Time Scaling Improve World Foundation Model?"
☆15Jul 12, 2025Updated last year
Red-Hat-AI-Innovation-Team / SQuat
View on GitHub
☆22Jun 5, 2025Updated last year
METR / Measuring-Early-2025-AI-on-Exp-OSS-Devs
View on GitHub
Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity: https://metr.org/blog/2025-07-10-early-2025-ai-e…
☆16Feb 23, 2026Updated 5 months ago
shangshang-wang / Tina
View on GitHub
[ICLR 2026] Tina: Tiny Reasoning Models via LoRA
☆338Sep 23, 2025Updated 10 months ago
Linzwcs / AutoMusicTheoryQA
View on GitHub
☆22Nov 21, 2025Updated 8 months ago
ulab-uiuc / FusionFactory
View on GitHub
[TMLR 2026]: "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary…
☆22Dec 30, 2025Updated 6 months ago
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 10 months ago
jinhangzhan / RL_Heals_SFT
View on GitHub
☆21Mar 22, 2026Updated 4 months ago
olivkoch / TinyRecursiveModels
View on GitHub
☆35Nov 11, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
rishabbala / Steering-Vector-Transfer
View on GitHub
☆18May 2, 2026Updated 2 months ago
Kurt232 / RLKV
View on GitHub
☆35Jun 8, 2026Updated last month
Simplified-Reasoning / SU-01
View on GitHub
SU-01: Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
☆94May 27, 2026Updated last month
NX-AI / xlstm_scaling_laws
View on GitHub
Code and data to explore neural scaling laws of xLSTM and Transformer models.
☆23Apr 8, 2026Updated 3 months ago
RUCAIBox / Passk_Training
View on GitHub
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
☆113Aug 15, 2025Updated 11 months ago
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
lhxcs / DVD-Quant
View on GitHub
☆17Oct 5, 2025Updated 9 months ago