[ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues
☆26Jul 10, 2025Updated 7 months ago
Alternatives and similar repositories for DICE-Bench
Users that are interested in DICE-Bench are comparing it to the libraries listed below
Sorting:
- The list of NLP paper and news I've checked. There might be short description of them (abstract) in Korean.☆37Updated this week
- MARU-Lang is an open-source RAG chatbot engine.☆27Feb 2, 2026Updated last month
- 한국어 벤치마크 평가 코드 통합본(?)☆20Nov 15, 2024Updated last year
- This repo investigates LLMs' tendency to exhibit acquiescence bias in sequential QA interactions. Includes evaluation methods, datasets, …☆49Sep 23, 2025Updated 5 months ago
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆104Jul 9, 2025Updated 7 months ago
- huggingface transformers tutorial, code, resources☆26Apr 7, 2024Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆56Jun 21, 2025Updated 8 months ago
- Forked repo from https://github.com/EleutherAI/lm-evaluation-harness/commit/1f66adc☆82Feb 28, 2024Updated 2 years ago
- Autonomous-driving delivery robot project : Selly☆10Jul 11, 2020Updated 5 years ago
- 어린이를 위한 동화 제작 서비스, My AI Fairy-Tale☆11Apr 7, 2023Updated 2 years ago
- This is code for the EMNLP 2022 Paper "UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation".☆10Apr 30, 2023Updated 2 years ago
- It summerizes the algorithms of Machine Learning.☆11Oct 26, 2025Updated 4 months ago
- ☆10May 19, 2024Updated last year
- 🗂️ Project tempfiles backend server!!☆10Apr 29, 2024Updated last year
- ☆10Sep 13, 2024Updated last year
- ☆10Oct 6, 2021Updated 4 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 7 months ago
- OCI의 혜자 무료 리소스를 극한으로 뽑아 완전관리형 쿠버네티스 클러스터를 만들어 주는 IaC 코드☆21Dec 19, 2024Updated last year
- Temporal Lifting (TLift), a model-free temporal cooccurrence based score weighting method proposed in "Interpretable and Generalizable Pe…☆10Jul 24, 2020Updated 5 years ago
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- 🏡Java 언어로 배우는 디자인 패턴 입문☆14Dec 8, 2020Updated 5 years ago
- An open feature provider for the LaunchDarkly node SDK.☆13Sep 11, 2025Updated 5 months ago
- personal website, blog, proj showcase☆15Feb 18, 2026Updated 2 weeks ago
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- An experimental web framework for creating user interfaces☆12Jan 30, 2024Updated 2 years ago
- ☆11Aug 3, 2024Updated last year
- 《GPT-4, ChatGPT, 라마인덱스, 랭체인을 활용한 인공지능 프로그래밍》 예제 코드☆10Jan 16, 2024Updated 2 years ago
- The implementation of the NeurIPS2020 paper: The Dilemma of TriHard Loss and an Element-Weighted TriHard Loss for Person Re-Identificatio…☆10Oct 22, 2020Updated 5 years ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated last year
- ☆12Jun 27, 2025Updated 8 months ago
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated 10 months ago
- ☆12Jan 2, 2024Updated 2 years ago
- ☆16Jul 17, 2025Updated 7 months ago
- Naver Boostcamp AI Tech Stage 3 : MRC (Machine Reading Comprehension)☆10Jun 10, 2021Updated 4 years ago
- final-project-level3-nlp-02 created by GitHub Classroom☆11Dec 31, 2021Updated 4 years ago
- 📢 Send a message directly to a Slack channel from a React app☆12Apr 12, 2024Updated last year
- "자연어처리 알고리즘을 활용한 느린학습자 교육 컨텐츠 제작" 프로젝트 "애움길" 팀입니다. 데이터 수집(크롤링)/EDA/Preprocessing, 쉬운말 생성요약 AI 모델링(NLP - KoBERT, KoBART), 프로토타입 제작을 진행했습니다…☆13Mar 24, 2022Updated 3 years ago