☆76Nov 23, 2025Updated 6 months ago
Alternatives and similar repositories for core-bench
Users that are interested in core-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆293Updated this week
- ☆53Apr 4, 2025Updated last year
- This project showcases a comprehensive analysis of CO2 emissions in a fictitious cheese manufacturing supply chain using both graph datab…☆11Sep 18, 2024Updated last year
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆27Apr 21, 2026Updated last month
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Dec 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆140Apr 29, 2026Updated last month
- ☆20May 22, 2025Updated last year
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆117Aug 17, 2025Updated 9 months ago
- ☆13Sep 26, 2024Updated last year
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆20Nov 3, 2025Updated 7 months ago
- ☆10Mar 5, 2024Updated 2 years ago
- Bayesian Inverse Graphics for Few-Shot Concept Learning☆12Mar 16, 2025Updated last year
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Bencharking pipeline for evaluating Transcriptomic representations for perturbation tasks☆13Nov 5, 2024Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- Harness used to benchmark aider against SWE Bench benchmarks☆84Jun 27, 2024Updated last year
- ☆15Jun 30, 2025Updated 11 months ago
- AI Assistance for Writing Scientific Alt Text☆14Feb 7, 2024Updated 2 years ago
- Fast Kolmogorov-Arnold Network in JAX, initial experiments☆16May 20, 2024Updated 2 years ago
- 📟 Logging utilities for spaCy☆12Nov 3, 2023Updated 2 years ago
- ☆57Jul 31, 2025Updated 10 months ago
- 基于Django的京东商品比价系统+基于request京东爬虫☆12Jun 19, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fuzzing Automatic Differentiation in Deep-Learning Libraries (ICSE'23)☆28Mar 2, 2024Updated 2 years ago
- Optimisation on Diffeomorphisms☆12Feb 17, 2025Updated last year
- a simple DBMS for DB course in ZJU with go☆12Aug 21, 2022Updated 3 years ago
- 浙江大学2025-2026学年秋冬学期 计算机网络 课程实验文档☆14May 17, 2026Updated 3 weeks ago
- ☆21Jun 12, 2024Updated last year
- ☆11Mar 27, 2023Updated 3 years ago
- My code solutions to exercises of Bayesian Reasoning and Machine Learning☆19Sep 2, 2021Updated 4 years ago
- Free Lunch for Testing: Fuzzing Deep-Learning Libraries from Open Source (ICSE'22)☆81Nov 2, 2022Updated 3 years ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆48Apr 15, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆19Dec 2, 2025Updated 6 months ago
- Evaluating methods for estimating aperiodic activity in electrophysiological data.☆17Sep 24, 2024Updated last year
- ☆15Jun 30, 2023Updated 2 years ago
- A curated list of papers on LLMs and agents for scientific research and development☆91Dec 11, 2024Updated last year
- Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech (ACL-IJCNLP 2021 Findings)☆13Jun 22, 2022Updated 3 years ago
- Reward Evolution with Large Language Models using Human Feedback☆20Nov 14, 2025Updated 6 months ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 11 months ago