shizhl / Confucius
Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`
☆17Updated 2 months ago
Alternatives and similar repositories for Confucius:
Users that are interested in Confucius are comparing it to the libraries listed below
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆53Updated last year
- ☆26Updated last month
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆42Updated 5 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆111Updated 6 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 4 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆140Updated this week
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 6 months ago
- The awesome agents in the era of large language models☆62Updated last year
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆166Updated 10 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆26Updated last year
- ☆54Updated 6 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆169Updated 3 months ago
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆46Updated last month
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆34Updated 3 months ago
- ☆25Updated last year
- [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO☆43Updated 2 weeks ago
- ☆91Updated last month
- ☆38Updated 2 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆42Updated 4 months ago
- The code and data of DPA-RAG☆58Updated 2 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆41Updated last year
- ☆29Updated 2 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆110Updated 9 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆114Updated 3 weeks ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆22Updated 9 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆65Updated 2 months ago