PeRL: Parameter-Efficient Reinforcement Learning
☆73Mar 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for PeRL
Users that are interested in PeRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model Merging with Functional Dual Anchors☆47Nov 23, 2025Updated 4 months ago
- ☆34Nov 11, 2025Updated 4 months ago
- Training tiny models to prove hard theorems☆64Mar 5, 2026Updated 2 weeks ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 5 months ago
- Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (AAAI 2021)☆25Jun 18, 2022Updated 3 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆27Oct 23, 2025Updated 5 months ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- GeoZarr extension for OpenLayers☆12Jun 27, 2024Updated last year
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆23Mar 2, 2026Updated 3 weeks ago
- RWKV-7 mini☆12Mar 29, 2025Updated 11 months ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆62Feb 18, 2026Updated last month
- Source code for SWIFT, an efficient reward model.☆19Jan 13, 2026Updated 2 months ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- ☆24Dec 11, 2024Updated last year
- Internal utility libraries for Pkl☆16Mar 10, 2026Updated last week
- ☆25Mar 14, 2026Updated last week
- Advantage Alignment Algorithms (ICLR 2025 oral)☆17Apr 7, 2025Updated 11 months ago
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated last year
- ☆30Jan 15, 2026Updated 2 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆40Updated this week
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated last month
- ☆32Mar 13, 2026Updated last week
- ☆507Feb 27, 2026Updated 3 weeks ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 8 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- TPU support for the fastai library☆13Apr 15, 2021Updated 4 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- ☆40Aug 6, 2025Updated 7 months ago
- The official repository of the first version of ACE-Brain foundation model.☆65Mar 13, 2026Updated last week
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- ☆32Oct 23, 2025Updated 5 months ago
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 10 months ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆79Dec 29, 2025Updated 2 months ago
- Yuan3.0: Mixture-of-Experts (MoE) Language Model☆175Feb 27, 2026Updated 3 weeks ago
- Implementation of the Adaptive Resonance Theory (ART) architectures - Fuzzy ART and Fuzzy ARTMAP - for pattern recognition☆11Jan 6, 2019Updated 7 years ago
- ☆51Oct 28, 2024Updated last year
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Oct 6, 2020Updated 5 years ago
- Repository for some of the experiments presented in the paper "Deep Learning Alternatives of the Kolmogorov Superposition Theorem", accep…☆22Mar 28, 2025Updated 11 months ago