plm-team / PLM
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
☆18Updated last month
Alternatives and similar repositories for PLM
Users that are interested in PLM are comparing it to the libraries listed below
Sorting:
- ☆47Updated 5 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆36Updated 2 months ago
- ☆15Updated 7 months ago
- ☆45Updated 3 months ago
- ☆22Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆97Updated 6 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆13Updated 3 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 4 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- ☆38Updated 4 months ago
- ☆63Updated last week
- ☆36Updated last month
- ☆64Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆69Updated last month
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆76Updated last month
- ☆27Updated 2 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆44Updated 6 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- ☆24Updated last month
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆44Updated last week
- official implementation of paper "Process Reward Model with Q-value Rankings"☆57Updated 3 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆48Updated this week
- PGRAG☆48Updated 10 months ago
- FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models☆43Updated last month
- SCOPE: Optimizing KV Cache Compression in Long-context Generation☆23Updated last week
- ☆12Updated this week
- ☆17Updated 4 months ago