PeRL: Parameter-Efficient Reinforcement Learning
☆75Apr 21, 2026Updated last week
Alternatives and similar repositories for PeRL
Users that are interested in PeRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of <Model Merging with Functional Dual Anchors>☆47Nov 23, 2025Updated 5 months ago
- ☆35Nov 11, 2025Updated 5 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 6 months ago
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated last month
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆13Aug 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆29Oct 23, 2025Updated 6 months ago
- GeoZarr extension for OpenLayers☆12Jun 27, 2024Updated last year
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated 3 weeks ago
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆49Oct 21, 2025Updated 6 months ago
- Source code for SWIFT, an efficient reward model.☆21Jan 13, 2026Updated 3 months ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆63Apr 20, 2026Updated last week
- Dynaseal is a dynamic API key management system designed to secure communications and identity verification for large model services. It …☆12Oct 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆28Mar 30, 2026Updated last month
- Internal utility libraries for Pkl☆16Apr 24, 2026Updated last week
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆45Apr 23, 2026Updated last week
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 2 months ago
- Multilingual and Multiculture Benchmark and LLM☆33Apr 23, 2026Updated last week
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 7 months ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated 11 months ago
- Implementation of the paper "In-context Time Series Predictor" (ICLR 2025)☆15Feb 11, 2025Updated last year
- ☆548Mar 30, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆35Nov 11, 2025Updated 5 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆53Mar 2, 2026Updated 2 months ago
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- ☆37Apr 21, 2026Updated last week
- GEMS: Agent-Native Multimodal Generation with Memory and Skills☆123Apr 1, 2026Updated last month
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated last year
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- Our solutions to Putnam 2025.☆96Jan 9, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated last month
- A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.☆13Oct 10, 2023Updated 2 years ago
- Yuan3.0: Mixture-of-Experts (MoE) Language Model☆184Apr 7, 2026Updated 3 weeks ago
- Implementation of the Adaptive Resonance Theory (ART) architectures - Fuzzy ART and Fuzzy ARTMAP - for pattern recognition☆11Jan 6, 2019Updated 7 years ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆84Dec 29, 2025Updated 4 months ago
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Oct 6, 2020Updated 5 years ago
- ☆51Oct 28, 2024Updated last year