Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation
☆23Apr 6, 2025Updated 11 months ago
Alternatives and similar repositories for BFCL-CN
Users that are interested in BFCL-CN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- ☆48Feb 10, 2025Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- A Terraform module to create and manage a Google Kubernetes Engine (GKE) Autopilot Cluster on Google Cloud Platform (GCP) https://cloud.g…☆10Nov 14, 2022Updated 3 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆25Aug 2, 2025Updated 7 months ago
- 哔哩哔哩常用API调用。☆17Aug 5, 2023Updated 2 years ago
- ☆13Dec 3, 2023Updated 2 years ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- A Unreal Engine plugin to create AVG Games☆13Nov 28, 2021Updated 4 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated 10 months ago
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆11Feb 6, 2025Updated last year
- ☆14Jul 21, 2025Updated 8 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios☆29Dec 1, 2025Updated 3 months ago
- ☆459Oct 16, 2025Updated 5 months ago
- TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.☆35Jul 31, 2025Updated 7 months ago
- 基于阿里云日志服务,aliyun-log-java-producer 封装的 spring-boot starter 组件支持☆13Jan 18, 2024Updated 2 years ago
- Action Proposals generated by deep models☆29Mar 19, 2017Updated 9 years ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆50Aug 26, 2024Updated last year
- Code for AAAI 2019 Network Interpretability workshop paper☆16Jul 5, 2021Updated 4 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- 软件工程与计算II☆11Dec 29, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- TeLL: Log Level Suggestions via Modeling Multi-Level Code Block Information, ISSTA'22☆14Jul 14, 2022Updated 3 years ago
- Source code and checkpoints for legal pre-trained language models.☆15May 9, 2021Updated 4 years ago
- Notes of Computer Organization and Architecture @ Software Institute, Nanjing University☆10Jan 26, 2021Updated 5 years ago
- ☆13Oct 13, 2025Updated 5 months ago
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 5 months ago
- C++ 高级程序设计☆15Jan 2, 2021Updated 5 years ago
- [ICSE'25] Aligning the Objective of LLM-based Program Repair☆23Mar 8, 2025Updated last year
- ☆46Mar 2, 2026Updated 3 weeks ago
- ☆15Nov 6, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 基于树莓派(Pi)和PyGame的魔镜(Mirror)☆18Aug 5, 2022Updated 3 years ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- ☆15Nov 23, 2020Updated 5 years ago
- 租房小程序方便运营者打造自己的小程序优势,及时更新核实租赁信息,保障信息的准确性,例如:大概的地址、租金具体多少、条件、优点和缺点; 有拍摄的照片,减少用户实地看房时间, 又能降低人力成本,一举双得。同时通过小程序实地推广,让周边顾客了解你,形成互动,不用每天主动找寻信息,转…☆13Jul 8, 2024Updated last year
- An package for creating hierarchical k-means/k-means tree/vocabulary tree.☆16Dec 26, 2016Updated 9 years ago
- Stanford Sentiment Treebank machine learning & sentiment analysis library☆40Sep 23, 2013Updated 12 years ago
- Egocentric Temporal Motifs Miner☆12Nov 9, 2021Updated 4 years ago