shizhl / Confucius
Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`
☆17Updated last month
Alternatives and similar repositories for Confucius:
Users that are interested in Confucius are comparing it to the libraries listed below
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆54Updated last year
- ☆33Updated last month
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆135Updated 3 weeks ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆56Updated 5 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 3 months ago
- ☆21Updated last month
- A Survey on the Honesty of Large Language Models☆56Updated 3 months ago
- The awesome agents in the era of large language models☆59Updated last year
- ☆48Updated last month
- ☆22Updated this week
- The code and data of DPA-RAG☆58Updated 2 months ago
- The repo for In-context Autoencoder☆117Updated 10 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆110Updated 6 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated last year
- ☆43Updated 5 months ago
- A Survey on Efficient Reasoning for LLMs☆116Updated this week
- [SIGIR'24] The official implementation code of MOELoRA.☆153Updated 8 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆54Updated 11 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆153Updated this week
- ☆131Updated 8 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆37Updated 3 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆55Updated last month
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆165Updated 2 months ago
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆24Updated last year
- [ACL 2024] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module …☆36Updated 8 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆98Updated 2 weeks ago
- ☆24Updated last year
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆33Updated 2 months ago
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Updated last year
- ☆80Updated last year