PeRL: Parameter-Efficient Reinforcement Learning
☆79May 20, 2026Updated 3 weeks ago
Alternatives and similar repositories for PeRL
Users that are interested in PeRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 7 months ago
- Github Repository for the HOI4 ULTRA Project.☆11Jun 6, 2026Updated last week
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 3 months ago
- Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (AAAI 2021)☆25Jun 18, 2022Updated 3 years ago
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆13Aug 9, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆30May 21, 2026Updated 3 weeks ago
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆49Oct 21, 2025Updated 7 months ago
- Twisted client library for AMQP (tested against RabbitMQ). This is a mirror and fork of the launchpad project: https://launchpad.net/txam…☆18May 23, 2012Updated 14 years ago
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆63Apr 20, 2026Updated last month
- Dynaseal is a dynamic API key management system designed to secure communications and identity verification for large model services. It …☆12Oct 30, 2024Updated last year
- ☆31Mar 30, 2026Updated 2 months ago
- Internal utility libraries for Pkl☆16Jun 4, 2026Updated last week
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆27Apr 4, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR2024] LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example☆13Jun 3, 2024Updated 2 years ago
- Generative Modeling via Drifting in MLX☆43Feb 6, 2026Updated 4 months ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 11 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- Advantage Alignment Algorithms (ICLR 2025 oral)☆20Apr 7, 2025Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 8 months ago
- TPU support for the fastai library☆14Apr 15, 2021Updated 5 years ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated last year
- ☆585May 24, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- ☆42Apr 8, 2026Updated 2 months ago
- Multitask NLU architecture for text and token classification tasks.☆14Jan 7, 2023Updated 3 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- ☆44Apr 28, 2026Updated last month
- Crosslingual Reasoning through Test-Time Scaling☆20May 13, 2025Updated last year
- ☆35Oct 23, 2025Updated 7 months ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- Implementation of the Adaptive Resonance Theory (ART) architectures - Fuzzy ART and Fuzzy ARTMAP - for pattern recognition☆11Jan 6, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆50Oct 28, 2024Updated last year
- P1: Mastering Physics Olympiads with Reinforcement Learning☆86Dec 29, 2025Updated 5 months ago
- Self Evolving Large Multimodal Models with Continuous Rewards☆24Updated this week
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆36Oct 13, 2025Updated 8 months ago
- Yuan3.0: Mixture-of-Experts (MoE) Language Model☆187Apr 7, 2026Updated 2 months ago
- Repository for some of the experiments presented in the paper "Deep Learning Alternatives of the Kolmogorov Superposition Theorem", Spotl…☆22Mar 28, 2025Updated last year
- ☆22Dec 28, 2024Updated last year