JungHoyoun / PromptCompressorLinks
β12Updated last year
Alternatives and similar repositories for PromptCompressor
Users that are interested in PromptCompressor are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025] Verification Engineering for RL in Instruction Followingβ44Updated 2 months ago
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β12Updated 4 months ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023β12Updated 2 years ago
- β22Updated last year
- β13Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learningβ44Updated 5 months ago
- Methods and evaluation for aligning language models temporallyβ30Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.β23Updated last year
- self-adaptive in-context learningβ45Updated 2 years ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Modelsβ33Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ51Updated 6 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedbackβ42Updated last year
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learningβ63Updated last month
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)β49Updated 2 years ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Modelsβ64Updated last year
- The code and data for the paper JiuZhang3.0β49Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?β85Updated last year
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward modelβ¦β60Updated 6 months ago
- β18Updated last year
- β30Updated 11 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Modelsβ17Updated 2 years ago
- β38Updated 4 months ago
- β87Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$β50Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"β33Updated 2 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"β13Updated 7 months ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Mβ¦β29Updated last year
- β108Updated 5 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"β38Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":β44Updated last year