Various LLM Benchmarks
☆26Feb 20, 2026Updated 3 months ago
Alternatives and similar repositories for llmbenchmark
Users that are interested in llmbenchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Nov 23, 2023Updated 2 years ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆40Dec 2, 2025Updated 6 months ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- ☆13Mar 23, 2025Updated last year
- Clipboard Regex Replace is a lightweight GoLang application that allows you to automatically apply regex-based replacements to your clipb…☆10Jan 20, 2026Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆15Mar 18, 2026Updated 2 months ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆29Dec 17, 2024Updated last year
- ☆17Mar 28, 2025Updated last year
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆46Dec 22, 2025Updated 5 months ago
- Docker in 600 lines of bash using proot☆17Oct 9, 2020Updated 5 years ago
- This repo consists of the code as discussed in the Medium blog.☆17Sep 10, 2023Updated 2 years ago
- Clue inspired puzzles for testing LLM deduction abilities☆47Mar 19, 2026Updated 2 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆41Apr 10, 2025Updated last year
- fork of litellm that is open source☆26Apr 29, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- GenAI Playground☆23Nov 6, 2024Updated last year
- Lego for GRPO☆30May 27, 2025Updated last year
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- ☆28Oct 14, 2024Updated last year
- The DPAB-α Benchmark☆32Jan 15, 2025Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated last year
- ☆58May 31, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Dec 17, 2022Updated 3 years ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Apr 2, 2025Updated last year
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- Code implementation for paper AbsenceBench: Language Models Can't Tell What's Missing☆19Oct 23, 2025Updated 7 months ago
- A simple tutorial for Package C Library for luajit.☆12Dec 28, 2020Updated 5 years ago
- ☆20Sep 27, 2025Updated 8 months ago
- Analyze Reddit posts☆31Updated this week
- An LLM Client for the PS Vita☆13Jun 23, 2025Updated 11 months ago
- Exploring how optimizations for GEMMs work☆36Feb 28, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A collection of Basalt (Bash) packages.☆14Feb 6, 2026Updated 4 months ago
- Prior Sampling for high dimension data with domain knowledge.☆10Jan 11, 2022Updated 4 years ago
- An expiring key/value cache with a Redis interface☆13Sep 10, 2015Updated 10 years ago
- Experimental Marimo extension for Agentic Notebooks -- integrating AI Agents into the Notebook workflow☆15Oct 11, 2025Updated 8 months ago
- [ACL 2025] NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering☆28Jul 29, 2025Updated 10 months ago
- Fill up the `model_list` field in your LiteLLM proxy configuration file☆10Sep 7, 2024Updated last year
- A new repo to demonstrate tutorials for using HuggingFace on Graphcore IPUs.☆12May 3, 2023Updated 3 years ago