All-in-one benchmarking platform for evaluating LLM.
☆15Nov 12, 2025Updated 5 months ago
Alternatives and similar repositories for evalhub
Users that are interested in evalhub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Oct 10, 2025Updated 6 months ago
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆20Oct 29, 2025Updated 5 months ago
- 2022 USTC 011705 (OSH) Course Project of Runikraft Group☆13Jul 22, 2022Updated 3 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 11 months ago
- ☆12Apr 6, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- ustc自动选课和换班脚本☆19Mar 8, 2025Updated last year
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale☆25Jul 31, 2025Updated 8 months ago
- ☆52Mar 9, 2026Updated last month
- possibly useful materials for learning RWKV language model.☆26Jun 8, 2023Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- A lightweight Inference Engine built for block diffusion models☆43Updated this week
- ☆11Jan 3, 2024Updated 2 years ago
- Labs of 2019 Web Information Processing and Application in USTC.☆11Jan 15, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "Molecule Design by Latent Space Energy-based Modeling and Gradual Distribution Shifting" in UAI 2023☆15Nov 15, 2023Updated 2 years ago
- 中科大计算机学院部分课程的试卷☆95Jul 25, 2025Updated 8 months ago
- ☆15Feb 26, 2025Updated last year
- ☆16Feb 6, 2024Updated 2 years ago
- Muon fsdp 2☆56Aug 8, 2025Updated 8 months ago
- Experiments on Multi-Head Latent Attention☆101Aug 19, 2024Updated last year
- My LaTeX slides packages.☆51Dec 6, 2016Updated 9 years ago
- ☆22May 2, 2025Updated 11 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Gender prediction of chinese name based on LSTM☆14Mar 16, 2023Updated 3 years ago
- The official implementation of dLLM-Var☆32Nov 6, 2025Updated 5 months ago
- Toolkit for Universal Retrieval, such as text retrieval, item recommendation, image retrieval, etc.☆17Sep 15, 2025Updated 6 months ago
- Reproducing R1 for Code with Reliable Rewards☆302May 5, 2025Updated 11 months ago
- ☆24Mar 12, 2025Updated last year
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆36Feb 9, 2026Updated 2 months ago
- An algorithm for computing the Fourier shell correlation from a single measurement☆15Nov 11, 2023Updated 2 years ago
- ☆115Jun 27, 2019Updated 6 years ago
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆25Apr 7, 2026Updated last week
- USTC-Computer Science-Resources☆44Nov 29, 2021Updated 4 years ago
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆274Apr 7, 2026Updated last week
- Code for MLSys 2024 Paper "SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models"☆22Apr 13, 2024Updated 2 years ago
- [IPMI 2025 Oral] 4DRGS: 4D Radiative Gaussian Splatting☆31Jan 23, 2026Updated 2 months ago
- ☆21Mar 2, 2022Updated 4 years ago
- ☆69Jul 8, 2025Updated 9 months ago