thunlp / ToolLearningPapers
☆900Updated 9 months ago
Alternatives and similar repositories for ToolLearningPapers:
Users that are interested in ToolLearningPapers are comparing it to the libraries listed below
- papers related to LLM-agent that published on top conferences☆314Updated last week
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆441Updated 3 months ago
- LongBench v2 and LongBench (ACL 2024)☆845Updated 3 months ago
- Paper collection on building and evaluating language model agents via executable language grounding☆352Updated 11 months ago
- Secrets of RLHF in Large Language Models Part I: PPO☆1,357Updated last year
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆952Updated 2 weeks ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,425Updated last year
- Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …☆1,010Updated 5 months ago
- [NIPS2023] RRHF & Wombat☆806Updated last year
- ☆913Updated 11 months ago
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,618Updated last year
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆279Updated last year
- Papers and Datasets on Instruction Tuning and Following. ✨✨✨☆490Updated last year
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆514Updated 6 months ago
- Paper List for In-context Learning 🌷☆853Updated 6 months ago
- Aligning Large Language Models with Human: A Survey☆727Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆616Updated 3 months ago
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,734Updated last year
- This is the repository for the Tool Learning survey.☆359Updated last month
- ☆518Updated 3 months ago
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,449Updated 10 months ago
- ☆459Updated 10 months ago
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆347Updated last year
- Collaborative Training of Large Language Models in an Efficient Way☆415Updated 7 months ago
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,516Updated 3 weeks ago
- Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"☆1,154Updated last year
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆2,033Updated last year
- Large Language Models Are Reasoning Teachers (ACL 2023)☆332Updated last month
- An Awesome Collection for LLM Survey☆338Updated 2 weeks ago
- ☆749Updated 10 months ago