Ellenzzn / PersLLM
☆10Updated 2 months ago
Alternatives and similar repositories for PersLLM:
Users that are interested in PersLLM are comparing it to the libraries listed below
- Implementation of AdaCQR(COLING 2025)☆10Updated 3 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆73Updated 9 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆35Updated 8 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆44Updated 3 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆97Updated 3 weeks ago
- ☆43Updated 5 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆56Updated 8 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆18Updated 4 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆18Updated 3 months ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆46Updated 9 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆23Updated 7 months ago
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆88Updated this week
- ☆26Updated 10 months ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 8 months ago
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆18Updated 6 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆38Updated 6 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆46Updated 3 months ago
- ☆20Updated 8 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆74Updated 2 months ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆10Updated 5 months ago
- ☆59Updated 7 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 5 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆39Updated 4 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆70Updated 4 months ago
- [Preprint] An inference-time decoding strategy with adaptive foresight sampling☆86Updated last week
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆61Updated 10 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆53Updated last year
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆11Updated 4 months ago