☆69Nov 23, 2025Updated 4 months ago
Alternatives and similar repositories for core-bench
Users that are interested in core-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆135Oct 16, 2025Updated 5 months ago
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13May 24, 2025Updated 10 months ago
- ☆49Apr 4, 2025Updated last year
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆24Mar 8, 2026Updated last month
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Dec 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Mar 13, 2025Updated last year
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆111Aug 17, 2025Updated 7 months ago
- ☆13Sep 26, 2024Updated last year
- Probe how GPT-n performs on statutory reasoning☆10Sep 17, 2024Updated last year
- Bayesian Inverse Graphics for Few-Shot Concept Learning☆12Mar 16, 2025Updated last year
- Econometrics on the GPU (and CPU) via JAX☆16Jul 12, 2025Updated 9 months ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- Harness used to benchmark aider against SWE Bench benchmarks☆80Jun 27, 2024Updated last year
- Workshop "Analyzing Social Media Data" at the Big Data and Development Conference☆11Sep 11, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Introduction to Econometrics at the University of Oregon (EC421) during Spring quarter, 2020. Taught by Ed Rubin☆14Jan 27, 2022Updated 4 years ago
- POSIX: A Prompt Sensitivity Index for Language Models☆13Nov 13, 2024Updated last year
- Joint estimation of sentiment and topics in textual data☆14Aug 9, 2023Updated 2 years ago
- AI Assistance for Writing Scientific Alt Text☆14Feb 7, 2024Updated 2 years ago
- 📟 Logging utilities for spaCy☆12Nov 3, 2023Updated 2 years ago
- MCP examples☆10May 20, 2025Updated 10 months ago
- Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.☆15Jun 29, 2025Updated 9 months ago
- Reward Evolution with Large Language Models using Human Feedback☆18Nov 14, 2025Updated 5 months ago
- Cross Atlas Remapping via Optimal Transport☆12Dec 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fuzzing Automatic Differentiation in Deep-Learning Libraries (ICSE'23)☆27Mar 2, 2024Updated 2 years ago
- a lightweight Python binding of the CLASS CMB Boltzmann code☆11Aug 8, 2025Updated 8 months ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆45Apr 15, 2025Updated 11 months ago
- A multilingual DeBERTa model fine-tuned on political communication to classify discrete emotions☆16Nov 10, 2023Updated 2 years ago
- Understand what physics/algorithms do transformers learn internally when trained on planetary motion☆39Feb 9, 2026Updated 2 months ago
- Code for moving DM in Nbody sims and painting baryons to Nbody☆20Apr 3, 2026Updated last week
- ☆20Jun 12, 2024Updated last year
- ☆16Jul 7, 2025Updated 9 months ago
- ☆17May 16, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18Dec 2, 2025Updated 4 months ago
- ☆11Sep 17, 2024Updated last year
- Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech (ACL-IJCNLP 2021 Findings)☆13Jun 22, 2022Updated 3 years ago
- Model-to-observable projection code for galaxy thermodynamics☆10Nov 25, 2022Updated 3 years ago
- A LaTeX template for formal theory papers in political science☆13Sep 13, 2022Updated 3 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".☆18Nov 26, 2025Updated 4 months ago