LAMDASZ-ML / ChinaTravelLinks
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning
☆23Updated last week
Alternatives and similar repositories for ChinaTravel
Users that are interested in ChinaTravel are comparing it to the libraries listed below
Sorting:
- OptiBench and ReSocratic Synthesis Method☆24Updated 3 months ago
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆68Updated 2 months ago
- ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse☆51Updated last year
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆32Updated last month
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Updated 4 months ago
- Survey on Robust Weakly Supervised Learning☆13Updated 3 years ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆34Updated 5 months ago
- Beimingwu is the first systematic open-source implementation of the learnware dock system, providing a preliminary research platform for …☆116Updated 11 months ago
- Code tasks for NJU TSA (Time Series Analysis) course☆13Updated last year
- A Framework of Continual Learning☆112Updated 2 weeks ago
- A minimal example of Abductive Learning☆13Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆37Updated 2 months ago
- ✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models☆113Updated last week
- Based on the learnware paradigm, the learnware package supports the entire process including the submission, usability testing, organizat…☆100Updated last month
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆80Updated 10 months ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆184Updated 5 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆37Updated 4 months ago
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Updated 3 months ago
- ☆14Updated 2 weeks ago
- Evolution of Heuristics☆169Updated 4 months ago
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆13Updated 3 months ago
- Official Repository of "Learning what reinforcement learning can't"☆32Updated last week
- Course projects and notes of undergraduate courses in NJUAI☆44Updated 6 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- 2021年秋季南京大学 强化学习 课程作业☆9Updated 3 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆28Updated 7 months ago
- A Python Package for Non-stationary Online Learning (PyNOL)☆31Updated last year
- A paper list of our recent survey on continual learning, and other useful resources in this field.☆83Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆24Updated 7 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆31Updated 4 months ago