PeRL: Parameter-Efficient Reinforcement Learning
☆71Feb 23, 2026Updated last week
Alternatives and similar repositories for PeRL
Users that are interested in PeRL are comparing it to the libraries listed below
Sorting:
- Model Merging with Functional Dual Anchors☆45Nov 23, 2025Updated 3 months ago
- ☆34Nov 11, 2025Updated 3 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆21Oct 16, 2025Updated 4 months ago
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆61Feb 18, 2026Updated last week
- Github Repository for the HOI4 ULTRA Project.☆11Updated this week
- Yuan3.0: Mixture-of-Experts (MoE) Language Model☆88Jan 9, 2026Updated last month
- ☆468Feb 22, 2026Updated last week
- Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (AAAI 2021)☆25Jun 18, 2022Updated 3 years ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- ☆69Nov 5, 2025Updated 3 months ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆54Apr 4, 2025Updated 10 months ago
- Kate is Multimodal Live Assistant that ignites your browsing experience☆11Feb 15, 2025Updated last year
- Our solution to Putnam 2025.☆77Jan 9, 2026Updated last month
- ☆10Apr 26, 2023Updated 2 years ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 6 months ago
- Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".☆126Jan 24, 2026Updated last month
- P1: Mastering Physics Olympiads with Reinforcement Learning☆73Dec 29, 2025Updated 2 months ago
- Awesome Entity Alignment is a collection of EA techniques, including papers, codes, and datasets.☆10Oct 27, 2022Updated 3 years ago
- my first ever browser game☆10Jun 21, 2025Updated 8 months ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated 8 months ago
- [CVPR 2021] FMO Deblurring Benchmark☆13Jan 12, 2022Updated 4 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆26Feb 12, 2026Updated 2 weeks ago
- ☆29Jan 15, 2026Updated last month
- ☆138Feb 13, 2026Updated 2 weeks ago
- Internal utility libraries for Pkl☆15Updated this week
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- Introduction to Java Programming, Comprehensive Version, 10th Edition Sample Code☆15Feb 4, 2022Updated 4 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆15Updated this week
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆31Feb 24, 2026Updated last week
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆12Aug 9, 2022Updated 3 years ago
- ☆17Dec 2, 2025Updated 3 months ago
- [ICCV' 23] MRM: Masked Relation Modeling for Medical Image Pre-Training with Genetics☆10Oct 28, 2024Updated last year
- ☆11May 27, 2022Updated 3 years ago