Ellenzzn / PersLLMLinks
☆10Updated 5 months ago
Alternatives and similar repositories for PersLLM
Users that are interested in PersLLM are comparing it to the libraries listed below
Sorting:
- Implementation of AdaCQR(COLING 2025)☆10Updated 5 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆39Updated 8 months ago
- ☆12Updated 5 months ago
- LightThinker: Thinking Step-by-Step Compression☆59Updated 2 months ago
- ☆38Updated 2 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆18Updated 7 months ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆10Updated 8 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated last month
- [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO☆50Updated last month
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆21Updated 9 months ago
- ☆40Updated 2 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆34Updated 5 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆27Updated last month
- Long Context Extension and Generalization in LLMs☆57Updated 9 months ago
- ☆116Updated last month
- ☆35Updated 3 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆99Updated last month
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆13Updated 2 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 4 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆50Updated last year
- ☆15Updated 2 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 4 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆22Updated 6 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 7 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆93Updated last month
- A project for tri-modal LLM benchmarking and instruction tuning.☆38Updated 3 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆58Updated last year