DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery
☆20Sep 24, 2025Updated 5 months ago
Alternatives and similar repositories for DatasetResearch
Users that are interested in DatasetResearch are comparing it to the libraries listed below
Sorting:
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- ☆17Aug 7, 2024Updated last year
- ☆31Aug 7, 2025Updated 6 months ago
- CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Lear…☆128Dec 1, 2025Updated 3 months ago
- ☆40Jan 14, 2025Updated last year
- A holistic benchmark for LLM abstention☆71Aug 27, 2025Updated 6 months ago
- Detect-Then-Explain Framework for Text-to-SQL task☆10Dec 6, 2023Updated 2 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆23Feb 11, 2026Updated 2 weeks ago
- Teaching Categories to Human Learners with Visual Explanations - CVPR 2018☆11Jun 21, 2022Updated 3 years ago
- A pytorch image classifier for the recognising letters from the notMNIST dataset☆11Jan 4, 2019Updated 7 years ago
- ☆11May 18, 2022Updated 3 years ago
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago
- ☆20Sep 11, 2025Updated 5 months ago
- ☆15Mar 12, 2024Updated last year
- ☆10Nov 7, 2022Updated 3 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 9 months ago
- ☆11Feb 28, 2024Updated 2 years ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆138Sep 5, 2025Updated 5 months ago
- ☆10Oct 17, 2021Updated 4 years ago
- ☆10Feb 19, 2019Updated 7 years ago
- The implementation of FedMix☆11Aug 18, 2022Updated 3 years ago
- Tool set for the VisSat project☆12Jun 21, 2022Updated 3 years ago
- ☆12Oct 3, 2023Updated 2 years ago
- T22_034_han_shi_hao_CRDDC_2022_SourceCode☆11Dec 29, 2023Updated 2 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- Solve ciphers with python☆10Oct 24, 2018Updated 7 years ago
- Real(ish)-time position based fluid simulation☆10Apr 30, 2015Updated 10 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- ☆11Jun 21, 2025Updated 8 months ago
- C++ code to help assign papers to reviewers, area chairs, etc in conferences like NIPS.☆14Jun 18, 2018Updated 7 years ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 2 years ago
- 使用torch.distributed实现DP/TP/PP☆13Dec 28, 2023Updated 2 years ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"☆11May 23, 2023Updated 2 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Feb 22, 2026Updated last week
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago