Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation
☆24Apr 6, 2025Updated last year
Alternatives and similar repositories for BFCL-CN
Users that are interested in BFCL-CN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- ☆48Feb 10, 2025Updated last year
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- A Terraform module to create and manage a Google Kubernetes Engine (GKE) Autopilot Cluster on Google Cloud Platform (GCP) https://cloud.g…☆10Nov 14, 2022Updated 3 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆26Aug 2, 2025Updated 8 months ago
- 哔哩哔哩常用API调用。☆17Aug 5, 2023Updated 2 years ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated 11 months ago
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆12Updated this week
- [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios☆30Dec 1, 2025Updated 4 months ago
- NJU-IT侠聊天机器人☆10Dec 13, 2021Updated 4 years ago
- TLAi+ Benchmarks☆31Feb 6, 2026Updated 2 months ago
- ☆475Oct 16, 2025Updated 5 months ago
- 基于阿里云日志服务,aliyun-log-java-producer 封装的 spring-boot starter 组件支持☆13Jan 18, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆51Aug 26, 2024Updated last year
- Action Proposals generated by deep models☆29Mar 19, 2017Updated 9 years ago
- ☆14Jul 21, 2025Updated 8 months ago
- ☆17Feb 26, 2024Updated 2 years ago
- Code for AAAI 2019 Network Interpretability workshop paper☆16Jul 5, 2021Updated 4 years ago
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆17Jun 18, 2024Updated last year
- TeLL: Log Level Suggestions via Modeling Multi-Level Code Block Information, ISSTA'22☆14Jul 14, 2022Updated 3 years ago
- Notes of Computer Organization and Architecture @ Software Institute, Nanjing University☆10Jan 26, 2021Updated 5 years ago
- ☆14Oct 13, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 6 months ago
- NLPCC 2020 MAMS 多属性多情感分析任务 第一名解决方案☆12Jul 6, 2023Updated 2 years ago
- ATEC 蚂蚁金服 交易风险预测 Final_score:0.7494☆12Oct 19, 2018Updated 7 years ago
- ☆50Mar 2, 2026Updated last month
- ☆17Oct 15, 2023Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- HeAssembly——赛博语言,但是汇编☆18Aug 19, 2022Updated 3 years ago
- ☆15Nov 6, 2024Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/210…☆29Jan 4, 2022Updated 4 years ago
- ☆15Nov 23, 2020Updated 5 years ago
- 租房小程序方便运营者打造自己的小程序优势,及时更新核实租赁信息,保障信息的准确性,例如:大概的地址、租金具体多少、条件、优点和缺点; 有拍摄的照片,减少用户实地看房时间,又能降低人力成本,一举双得。同时通过小程序实地推广,让周边顾客了解你,形成互动,不用每天主动找寻信息,转…☆13Jul 8, 2024Updated last year
- An package for creating hierarchical k-means/k-means tree/vocabulary tree.☆16Dec 26, 2016Updated 9 years ago
- ☆19Jul 2, 2022Updated 3 years ago
- Egocentric Temporal Motifs Miner☆12Nov 9, 2021Updated 4 years ago
- A repository to introduce the algorithmic information theory. You could learn what is Kolmogorov complexity and why it is important here.☆13Jul 23, 2025Updated 8 months ago