A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions
☆64Apr 29, 2025Updated last year
Alternatives and similar repositories for When2Call
Users that are interested in When2Call are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆70May 13, 2025Updated last year
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆31Jun 3, 2025Updated last year
- The first large scale formally verified reasoning dataset for Verilog☆21May 16, 2025Updated last year
- ☆27May 28, 2025Updated last year
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆25Nov 25, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 5 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 11 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆394Apr 3, 2026Updated 2 months ago
- Code for the paper "Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction" presented at CoRL 202…☆32Nov 18, 2024Updated last year
- 🍎Wende Chinese QA system (experimental)☆10Jun 1, 2021Updated 5 years ago
- An automatic workflow to search for topological materials in 1651 magnetic space groups. Ref: J. Gao, et al. "Magnetic band representatio…☆21Jul 16, 2025Updated 10 months ago
- Generate Python docstrings automatically with LLM and syntax trees☆20Jun 13, 2025Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- (EMBC 2020) Camera-based Hand Tracking using a Mirror-based Multi-view Setup☆15Dec 11, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- DataSciBench: An LLM Agent Benchmark for Data Science (Findings of ACL 2026)☆59Jan 21, 2026Updated 4 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- 本项目是July的《程序员编程艺术》的电子书版本☆10Jan 9, 2014Updated 12 years ago
- A framework for in context learning for code optimization☆54Mar 14, 2026Updated 2 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 6 months ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆23Mar 3, 2025Updated last year
- ☆11Jun 11, 2024Updated 2 years ago
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13May 24, 2025Updated last year
- ☆15May 12, 2025Updated last year
- Minimalist implementation of a GPT2 with Language Model Head with PyTorch Lightning, Transformers and PyTorch-NLP.☆24Jun 12, 2023Updated 3 years ago
- This is the official implementation for "AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!".☆102Oct 16, 2025Updated 7 months ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 7 months ago
- Official repository of the paper MPMQA: Multimodal Question Answering on Product Manuals (AAAI 2023)☆19Nov 28, 2022Updated 3 years ago
- ☆22Jan 13, 2025Updated last year
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration" (ICML 2026)☆24Feb 4, 2026Updated 4 months ago
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆29Apr 8, 2025Updated last year
- The corresponding code from our paper "Social Commonsense Reasoning with Multi-Head Knowledge Attention (EMNLP 2020)". Do not hesitate to…☆11Jun 12, 2022Updated 4 years ago
- 2019 Baidu Machine Reading Comprehension Competition!☆10Jun 3, 2019Updated 7 years ago
- AlignX-Family is an open-source research suite for advancing personalization in large language models-spanning data, code, models, and be…☆20Jan 12, 2026Updated 5 months ago
- AgentQL's integrations with workflow automation tools and AI agent frameworks let you extract structured data from web pages using querie…☆27Jun 3, 2026Updated last week
- [ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆31Oct 14, 2025Updated 7 months ago
- Resources for the Enigmata Project.☆81Aug 13, 2025Updated 9 months ago