scicode-bench / SciCode
A benchmark that challenges language models to code solutions for scientific problems
☆109Updated this week
Alternatives and similar repositories for SciCode:
Users that are interested in SciCode are comparing it to the libraries listed below
- Benchmarking LLMs with Challenging Tasks from Real Users☆217Updated 4 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆210Updated 10 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆153Updated 3 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆138Updated last month
- ☆69Updated last month
- Can Language Models Solve Olympiad Programming?