This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.
☆101Sep 5, 2025Updated 9 months ago
Alternatives and similar repositories for 41-llms-evaluated-on-19-benchmarks
Users that are interested in 41-llms-evaluated-on-19-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llm-eval-simple is a simple LLM evaluation framework with intermediate actions and prompt pattern selection☆69Feb 28, 2026Updated 3 months ago
- ☆19Jan 17, 2021Updated 5 years ago
- ☆29Mar 2, 2026Updated 3 months ago
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 11 months ago
- ☆13Jun 2, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Quadratic formula implemented in real life.☆17Aug 12, 2025Updated 10 months ago
- DSPY Experiments☆15May 2, 2024Updated 2 years ago
- Use Amazon Comprehend Medical to extract medical insight from notes inside the OMOP Common Data Model☆14Feb 28, 2019Updated 7 years ago
- ☆13Jan 8, 2024Updated 2 years ago
- Python library for analyzing data quality and its impact on model performance across classification and object-detection tasks.☆17Updated this week
- Aplicação em Python para Optical Character Recognition (OCR), uma técnica para extrair textos em imagens. Adicionalmente, o programa tent…☆12Aug 13, 2021Updated 4 years ago
- a transformer implemented primarily using einops and trained on the tinystories dataset☆13Jun 21, 2024Updated last year
- Aulas de conceitos básicos de Processamento de Linguagem Natural oferecida no Discord aberto no Turing USP☆10Jul 30, 2021Updated 4 years ago
- the rent a hal project for AI☆23Apr 11, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An extension to use Kokoro TTS in text generation webui☆22May 5, 2025Updated last year
- Python scripts to read a Portuguese Wikipedia XML dump file, parse it and generate plain text files.☆14Mar 12, 2014Updated 12 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Apr 2, 2025Updated last year
- ☆71May 19, 2025Updated last year
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- Coding with ChatGPT and other LLMs, published by Packt☆16Dec 9, 2024Updated last year
- Provide LLMs hosted, clean markdown documentation of libraries and frameworks☆36Apr 22, 2025Updated last year
- OpenPipe Reinforcement Learning Experiments☆33Mar 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains code that was used to train and evaluate deep learning models, as described in the article "Improving breast can…☆16Aug 13, 2022Updated 3 years ago
- Brazilian Tertiary Care Dataset☆17Dec 14, 2022Updated 3 years ago
- Leveraging☆13Dec 7, 2023Updated 2 years ago
- Generate Large Language Model text in a container.☆20Mar 24, 2023Updated 3 years ago
- Repository for CSCI E/S-108 Data Mining and Exploration Course☆24Jun 5, 2026Updated last week
- Natural Language Understanding with Python published by Packt Publishing☆43Sep 20, 2023Updated 2 years ago
- ☆51Oct 1, 2025Updated 8 months ago
- ☆10Jul 30, 2019Updated 6 years ago
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ROCm Install Utilities: rocminstall.py script to install a specific ROCm release version/revision.☆13Jun 20, 2025Updated 11 months ago
- Komoran 3 in Python☆11Dec 10, 2018Updated 7 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- ☆25May 19, 2026Updated 3 weeks ago
- falcon game server☆25Feb 13, 2019Updated 7 years ago
- CLI Tool for 8k context ai models☆114Apr 28, 2026Updated last month
- A Multi-Agent System☆34Apr 28, 2025Updated last year