Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`
☆46Feb 9, 2025Updated last year
Alternatives and similar repositories for Confucius
Users that are interested in Confucius are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 7 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73May 13, 2025Updated 11 months ago
- ☆24Feb 27, 2026Updated last month
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆45Oct 29, 2025Updated 5 months ago
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆27Nov 29, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆15May 13, 2025Updated 11 months ago
- The awesome agents in the era of large language models☆72Nov 18, 2023Updated 2 years ago
- ☆28Jan 30, 2026Updated 2 months ago
- [ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆112Mar 21, 2024Updated 2 years ago
- Automatic support for (Claude) Skills for any coding agent that supports AGENTS.md☆31Feb 7, 2026Updated 2 months ago
- ☆12Apr 1, 2025Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆71Aug 5, 2025Updated 8 months ago
- A list of awesome papers on LLM tool learning.☆28Jul 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Compare AI model pricing and performance in a simple interactive web app.☆18Apr 10, 2026Updated last week
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Ultimate playbook for unmoderated UX testing☆13Jan 27, 2025Updated last year
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- ☆13Jun 5, 2023Updated 2 years ago
- ☆11Jun 11, 2024Updated last year
- 基于Roformer的文本相似度☆12Aug 2, 2021Updated 4 years ago
- ☆14Aug 21, 2025Updated 7 months ago
- Chinese Generation Evaluation☆13Aug 14, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Build a level 1 coding agent.☆17Jan 28, 2025Updated last year
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆30Nov 4, 2025Updated 5 months ago
- ☆30Jun 5, 2025Updated 10 months ago
- APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking a…☆66Apr 3, 2023Updated 3 years ago
- An MCP server for Raindrop.io (bookmarking service)☆20Apr 10, 2025Updated last year
- Repository for the paper 'Medical diffusion on a budget: textual inversion for medical image generation'☆12Dec 11, 2024Updated last year
- Code4Bench: A Mutildimensional Benchmark of Codeforces Data for Different Program Analysis Techniques☆17Apr 12, 2019Updated 7 years ago
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- ☆18Mar 19, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval☆18Feb 29, 2024Updated 2 years ago
- Repository for the Exposing Outlier Exposure paper☆12Aug 20, 2024Updated last year
- insight data engineering fellow project☆16Nov 14, 2016Updated 9 years ago
- ALBench Leaderboard for active learning in object detection☆15Jan 13, 2023Updated 3 years ago
- Effective training of convolutional neural networks for age estimation based on knowledge distillation☆18Jun 7, 2021Updated 4 years ago
- Web-grounded natural language instructions☆18Nov 25, 2024Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆89May 31, 2024Updated last year