chengzl18 / LEGENT-dev
☆10Updated this week
Related projects: ⓘ
- Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246)☆207Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆148Updated 6 months ago
- Achieving Efficient Alignment through Learned Correction☆103Updated 3 months ago
- ☆25Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆72Updated 4 months ago
- Awesome papers for role-playing with language models☆88Updated last month
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆146Updated this week
- ☆158Updated 3 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆96Updated 7 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆101Updated last month
- Paper collections of the continuous effort start from World Models.☆127Updated 2 months ago
- ☆71Updated 8 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆42Updated 5 months ago
- ☆246Updated 9 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆208Updated last week
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆134Updated 3 months ago
- Collection of papers for scalable automated alignment.☆49Updated 2 weeks ago
- ☆22Updated 2 months ago
- LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation☆194Updated 4 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated 8 months ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆104Updated 2 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆144Updated 7 months ago
- Reformatted Alignment☆111Updated 4 months ago
- ☆15Updated last week
- ☆94Updated 11 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆101Updated this week
- [arxiv:2406.17419]Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆62Updated last month
- Do Large Language Models Know What They Don’t Know?☆84Updated 9 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆63Updated 5 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆61Updated 2 months ago