☆69Nov 23, 2025Updated 3 months ago
Alternatives and similar repositories for core-bench
Users that are interested in core-bench are comparing it to the libraries listed below
Sorting:
- ☆134Oct 16, 2025Updated 5 months ago
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13May 24, 2025Updated 9 months ago
- ☆49Apr 4, 2025Updated 11 months ago
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆23Mar 8, 2026Updated 2 weeks ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆132Mar 5, 2026Updated 2 weeks ago
- ☆20May 22, 2025Updated 10 months ago
- ☆12Mar 13, 2025Updated last year
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆111Aug 17, 2025Updated 7 months ago
- ☆13Sep 26, 2024Updated last year
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆19Nov 3, 2025Updated 4 months ago
- ☆36Nov 7, 2025Updated 4 months ago
- ☆10Mar 5, 2024Updated 2 years ago
- Bayesian Inverse Graphics for Few-Shot Concept Learning☆12Mar 16, 2025Updated last year
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- Bencharking pipeline for evaluating Transcriptomic representations for perturbation tasks☆12Nov 5, 2024Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 3 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- Joint estimation of sentiment and topics in textual data☆14Aug 9, 2023Updated 2 years ago
- 📟 Logging utilities for spaCy☆12Nov 3, 2023Updated 2 years ago
- ☆53Jul 31, 2025Updated 7 months ago
- A python package to efficiently extract linguistic features for text/NLP datasets☆28Mar 3, 2026Updated 2 weeks ago
- Helpers For Meta-Analysis☆16Sep 26, 2024Updated last year
- Fuzzing Automatic Differentiation in Deep-Learning Libraries (ICSE'23)☆27Mar 2, 2024Updated 2 years ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆45Apr 15, 2025Updated 11 months ago
- Materials for the 2022 GESIS Training workshop "Automatic Sampling and Analysis of YouTube Comments"☆10Feb 22, 2022Updated 4 years ago
- Materials for the 2021 GESIS Summer School in Survey Methodology course "Introduction to R for Data Analysis"☆16Aug 9, 2021Updated 4 years ago
- Longformer Encoder Decoder model for the legal domain, trained for long document abstractive summarization task.☆10Feb 26, 2021Updated 5 years ago
- ☆38Jun 14, 2025Updated 9 months ago
- A curated list of papers on LLMs and agents for scientific research and development☆86Dec 11, 2024Updated last year
- SysX☆36Mar 27, 2024Updated last year
- Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech (ACL-IJCNLP 2021 Findings)☆13Jun 22, 2022Updated 3 years ago
- ☆13Jan 1, 2018Updated 8 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- ☆13Feb 21, 2024Updated 2 years ago
- ☆17Mar 4, 2025Updated last year
- The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".☆18Nov 26, 2025Updated 3 months ago
- ☆11Oct 3, 2021Updated 4 years ago