This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.
☆99Sep 5, 2025Updated 8 months ago
Alternatives and similar repositories for 41-llms-evaluated-on-19-benchmarks
Users that are interested in 41-llms-evaluated-on-19-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llm-eval-simple is a simple LLM evaluation framework with intermediate actions and prompt pattern selection☆66Feb 28, 2026Updated 2 months ago
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Apr 10, 2026Updated 3 weeks ago
- An eternal dialogue between AI models across versions. Started by Claude Opus 4 with 50 minutes to create a legacy.☆13Jun 2, 2025Updated 11 months ago
- ☆19Jan 17, 2021Updated 5 years ago
- MCP Proxy Server. Streaming. Node/Python. OAuth w/ DCR.☆14Apr 25, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 9 months ago
- ☆13Jun 2, 2024Updated last year
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆12Jun 23, 2024Updated last year
- ☆13Jan 8, 2024Updated 2 years ago
- Graphlit Platform☆31Feb 20, 2024Updated 2 years ago
- ☆24Nov 6, 2025Updated 6 months ago
- Aplicação em Python para Optical Character Recognition (OCR), uma técnica para extrair textos em imagens. Adicionalmente, o programa tent…☆12Aug 13, 2021Updated 4 years ago
- a transformer implemented primarily using einops and trained on the tinystories dataset☆13Jun 21, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Sep 13, 2024Updated last year
- the rent a hal project for AI☆21Apr 11, 2026Updated 3 weeks ago
- ☆17Feb 8, 2025Updated last year
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- A Very Simple Demo of Fine Tuning Sentence Transformers☆15Jun 15, 2023Updated 2 years ago
- Coding with ChatGPT and other LLMs, published by Packt☆16Dec 9, 2024Updated last year
- ☆49Apr 29, 2025Updated last year
- AI-powered Python library that converts any document (PDF, Word, Excel, PowerPoint, HTML) to clean Markdown while preserving complex tabl…☆48Mar 18, 2026Updated last month
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MCP server for searching and surfacing Claude Code conversation history☆65Feb 27, 2026Updated 2 months ago
- Provide LLMs hosted, clean markdown documentation of libraries and frameworks☆35Apr 22, 2025Updated last year
- This course is published by Packt Publishing☆23Aug 2, 2023Updated 2 years ago
- Instant redline with AI summary☆38Dec 7, 2025Updated 5 months ago
- This repository contains code that was used to train and evaluate deep learning models, as described in the article "Improving breast can…☆16Aug 13, 2022Updated 3 years ago
- Leveraging☆13Dec 7, 2023Updated 2 years ago
- BookFusion Calibre Plugin☆22Apr 28, 2023Updated 3 years ago
- ☆51Oct 1, 2025Updated 7 months ago
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Mar 26, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- open source assistant hybrid using small models (2b - 5b) and gemini , with image and agentic tool capabilities and integration of RAG…☆230Sep 30, 2025Updated 7 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- ☆19Dec 2, 2024Updated last year
- ☆20Oct 8, 2024Updated last year
- CLI tool to pull short descriptions of all currently running docker containers☆26Aug 4, 2025Updated 9 months ago
- A Multi-Agent System☆32Apr 28, 2025Updated last year
- Monolithic (Single) Docker Container for Obico Server☆12Apr 24, 2026Updated 2 weeks ago