PeRL: Parameter-Efficient Reinforcement Learning
☆74Apr 6, 2026Updated last week
Alternatives and similar repositories for PeRL
Users that are interested in PeRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 5 months ago
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆13Aug 9, 2022Updated 3 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆27Oct 23, 2025Updated 5 months ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- Source code for SWIFT, an efficient reward model.☆20Jan 13, 2026Updated 3 months ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆63Feb 18, 2026Updated last month
- Dynaseal is a dynamic API key management system designed to secure communications and identity verification for large model services. It …☆12Oct 30, 2024Updated last year
- ☆26Mar 30, 2026Updated 2 weeks ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆18Apr 7, 2025Updated last year
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆42Mar 23, 2026Updated 3 weeks ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 9 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 6 months ago
- ☆532Mar 30, 2026Updated last week
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 10 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆51Mar 2, 2026Updated last month
- ☆33Nov 11, 2025Updated 5 months ago
- ☆21Dec 3, 2025Updated 4 months ago
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆41Updated this week
- ☆35Mar 13, 2026Updated last month
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated 11 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- ☆33Oct 23, 2025Updated 5 months ago
- ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands☆108Mar 27, 2026Updated 2 weeks ago
- A curated list of research papers and resources on Cultural LLM.☆52Sep 26, 2024Updated last year
- P1: Mastering Physics Olympiads with Reinforcement Learning☆82Dec 29, 2025Updated 3 months ago
- Implementation of the Adaptive Resonance Theory (ART) architectures - Fuzzy ART and Fuzzy ARTMAP - for pattern recognition☆11Jan 6, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆21Mar 5, 2026Updated last month
- ☆51Oct 28, 2024Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆33Oct 13, 2025Updated 6 months ago
- Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks☆20Mar 26, 2021Updated 5 years ago
- ☆60Mar 2, 2026Updated last month
- The official repository of the first version of ACE-Brain foundation model.☆72Mar 13, 2026Updated last month
- ☆18Aug 14, 2024Updated last year