PKU-Baichuan-MLSystemLab / PAS
☆51Updated 7 months ago
Alternatives and similar repositories for PAS:
Users that are interested in PAS are comparing it to the libraries listed below
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 4 months ago
- ☆46Updated 10 months ago
- ☆102Updated 5 months ago
- ☆115Updated 2 weeks ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆119Updated 6 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆135Updated 4 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆90Updated 3 weeks ago
- ☆132Updated 3 months ago
- On Memorization of Large Language Models in Logical Reasoning☆65Updated last month
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- ☆143Updated 10 months ago
- The demo, code and data of FollowRAG☆72Updated 2 weeks ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 2 weeks ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆150Updated 8 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆92Updated 3 weeks ago
- ☆149Updated last week
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆62Updated 6 months ago
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆24Updated 4 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 5 months ago
- ☆54Updated 2 months ago
- a-m-team's exploration in large language modeling☆60Updated this week
- A Toolkit for Table-based Question Answering☆112Updated last year
- Code implementation of synthetic continued pretraining☆107Updated 4 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆75Updated last month
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆103Updated last month
- A research repo for experiments about Reinforcement Finetuning☆46Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 7 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆244Updated 3 weeks ago