LAMDASZ-ML / ChinaTravelLinks
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning
☆37Updated last week
Alternatives and similar repositories for ChinaTravel
Users that are interested in ChinaTravel are comparing it to the libraries listed below
Sorting:
- ☆161Updated 2 weeks ago
- ☆410Updated 6 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆470Updated 3 weeks ago
- A Framework of Continual Learning☆117Updated last month
- ✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models☆124Updated last month
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆147Updated 2 months ago
- Awesome RL-based LLM Reasoning☆592Updated 3 weeks ago
- A paper list of our recent survey on continual learning, and other useful resources in this field.☆85Updated last year
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆34Updated 2 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆202Updated 3 months ago
- Monitor Google Scholar author citation counts and track changes automatically without opening tabs.☆65Updated this week
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆210Updated 2 weeks ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆31Updated this week
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆293Updated last month
- Awesome RL Reasoning Recipes ("Triple R")☆768Updated last month
- ☆67Updated 6 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆59Updated 6 months ago
- Based on the learnware paradigm, the learnware package supports the entire process including the submission, usability testing, organizat…☆101Updated 2 months ago
- ICLR 2025 Agent-Related Papers☆71Updated 8 months ago
- Awesome Agent Training☆208Updated this week
- ☆310Updated 2 months ago
- CycleResearcher: Improving Automated Research via Automated Review☆220Updated last month
- ☆78Updated 11 months ago
- ☆255Updated last month
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration☆107Updated last week
- Beimingwu is the first systematic open-source implementation of the learnware dock system, providing a preliminary research platform for …☆117Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆133Updated 3 weeks ago
- [ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"☆31Updated last month
- Latest Advances on System-2 Reasoning☆1,214Updated 2 months ago
- llm & rl☆182Updated last week