A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions
☆60Apr 29, 2025Updated last year
Alternatives and similar repositories for When2Call
Users that are interested in When2Call are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆68May 13, 2025Updated 11 months ago
- ☆35May 24, 2025Updated 11 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 10 months ago
- The first large scale formally verified reasoning dataset for Verilog☆21May 16, 2025Updated 11 months ago
- FamilyTool benchmark☆13Sep 10, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆26May 28, 2025Updated 11 months ago
- ☆10Dec 18, 2023Updated 2 years ago
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆25Nov 25, 2025Updated 5 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 4 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 10 months ago
- 武大信图抢座程序 支持后台持续监测,抢靠窗、有电脑的座位 以及抢座成功后自动关机☆15Dec 8, 2022Updated 3 years ago
- VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …☆22Sep 16, 2025Updated 7 months ago
- An automatic workflow to search for topological materials in 1651 magnetic space groups. Ref: J. Gao, et al. "Magnetic band representatio…☆19Jul 16, 2025Updated 9 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆371Apr 3, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 部署在树莓派上实现移动物体的实时监控。(开源版本)☆18Jan 11, 2023Updated 3 years ago
- ☆13Feb 8, 2025Updated last year
- The Artifacts for ICSE 2023 paper: Bad Snakes: Understanding and Improving Python Package Index Malware Scanning☆13Feb 8, 2026Updated 2 months ago
- Code for the paper "Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction" presented at CoRL 202…☆31Nov 18, 2024Updated last year
- DataSciBench: An LLM Agent Benchmark for Data Science☆57Jan 21, 2026Updated 3 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- An extended version of Brainfuck with parallel processing and locking support☆25Aug 15, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- ☆343Aug 12, 2025Updated 8 months ago
- Pretrained models, codes and guidances to pretrain official ALBERT(https://github.com/google-research/albert) on Japanese Wikipedia☆13Sep 26, 2023Updated 2 years ago
- ☆11Jun 11, 2024Updated last year
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 8 months ago
- ☆14May 12, 2025Updated 11 months ago
- Cheng-Hao Tu, Cheng-En Wu and Chu-Song Chen, "Extending Conditional Convolution Structures for Enhancing Multitasking Continual Learning,…☆14Jan 8, 2021Updated 5 years ago
- ☆11Nov 28, 2022Updated 3 years ago
- Minimalist implementation of a GPT2 with Language Model Head with PyTorch Lightning, Transformers and PyTorch-NLP.☆24Jun 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official implementation for "AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!".☆104Oct 16, 2025Updated 6 months ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 5 months ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)☆19Nov 28, 2022Updated 3 years ago
- Articulated Pose Estimator algorithm for human hands and body☆13Sep 21, 2018Updated 7 years ago
- ☆22Jan 13, 2025Updated last year
- ☆15Jun 23, 2019Updated 6 years ago
- ☆29Apr 8, 2025Updated last year