jayminban/41-llms-evaluated-on-19-benchmarks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jayminban/41-llms-evaluated-on-19-benchmarks)

jayminban / 41-llms-evaluated-on-19-benchmarks

This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.

☆99

Alternatives and similar repositories for 41-llms-evaluated-on-19-benchmarks

Users that are interested in 41-llms-evaluated-on-19-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GVDub / panai-seed-node
View on GitHub
“A locally hosted, memory-aware AI microservice—designed for cultural continuity, decentralized intelligence, and ethical autonomy.”
☆28May 1, 2025Updated last year
pranavkumaarofficial / nlcli-wizard
View on GitHub
Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)
☆30Apr 10, 2026Updated last month
MissionSquad / mcp-api
View on GitHub
MCP Proxy Server. Streaming. Node/Python. OAuth w/ DCR.
☆14May 20, 2026Updated last week
Abinesh-Mathivanan / beens-minimax
View on GitHub
world's stupidest moe llm in 103M parameters
☆20Jul 18, 2025Updated 10 months ago
danielrosehill / OpenWebUI-Prompt-Library
View on GitHub
A selection of prompts designed for use with Open Web UI (rather than conventional prompts, they're more useful for steering existing con…
☆44Feb 22, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aws-samples / amazon-comprehend-medical-omop-notes-mapping
View on GitHub
Use Amazon Comprehend Medical to extract medical insight from notes inside the OMOP Common Data Model
☆14Feb 28, 2019Updated 7 years ago
AI-ANK / Airbnb-Listing-Explorer
View on GitHub
☆29Apr 29, 2024Updated 2 years ago
PacktPublishing / Microsoft-Power-BI-Performance-Best-Practices-Second-Edition
View on GitHub
"Microsoft Power BI Performance Best Practices - Second Edition, published by Packt"
☆12Mar 2, 2026Updated 2 months ago
simonw / datasette-write
View on GitHub
Datasette plugin providing a UI for executing SQL writes against the database
☆12Nov 11, 2025Updated 6 months ago
turing-usp / conceitos-basicos-NLP
View on GitHub
Aulas de conceitos básicos de Processamento de Linguagem Natural oferecida no Discord aberto no Turing USP
☆10Jul 30, 2021Updated 4 years ago
jimpames / rentahal
View on GitHub
the rent a hal project for AI
☆21Apr 11, 2026Updated last month
erickrf / ptwiki2text
View on GitHub
Python scripts to read a Portuguese Wikipedia XML dump file, parse it and generate plain text files.
☆14Mar 12, 2014Updated 12 years ago
microsoft / aitour-rag-with-ai-search
View on GitHub
☆30Feb 14, 2025Updated last year
Amadeus-AI-Official / pt-br-llms
View on GitHub
Self-contained, comprehensive overview of PT-BR-LLMs advancements, architectures, and resources.
☆31Dec 31, 2025Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
vincentclaes / whisperx-on-aws-lambda
View on GitHub
☆17Feb 8, 2025Updated last year
wassname / rl_2d_walker.js
View on GitHub
Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)
☆10Sep 7, 2020Updated 5 years ago
AndreaDev3D / OllamaChat
View on GitHub
☆71May 19, 2025Updated last year
GuckTubeYT / GTBotLinux
View on GitHub
A GrowtopiaBot can run on Linux, Credit = DrOreo002 and GrowtopiaNoobs
☆17May 8, 2022Updated 4 years ago
shmulvad / zero-for-ner
View on GitHub
Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge
☆17Nov 16, 2021Updated 4 years ago
PacktPublishing / Coding-with-ChatGPT-and-Other-LLMs
View on GitHub
Coding with ChatGPT and other LLMs, published by Packt
☆16Dec 9, 2024Updated last year
Logisx / AI-Senior
View on GitHub
🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.
☆17Jan 14, 2024Updated 2 years ago
kkga / flux-menubar-icon
View on GitHub
Replacement menu bar icon for f.lux (http://justgetflux.com)
☆21Dec 9, 2015Updated 10 years ago
Artefact2 / llm-sampling
View on GitHub
A very simple interactive demo to understand the common LLM samplers.
☆41Jul 9, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zmattmanz / flock-detection
View on GitHub
ESP32-S3 surveillance detector — identifies Flock Safety ALPR cameras and Raven gunshot detectors via WiFi/BLE with confidence scoring, G…
☆61Mar 2, 2026Updated 2 months ago
Center-for-Health-Data-Science / scDGD
View on GitHub
This is the repository for the single-cell transcriptomics application of the Deep Generative Decoder (DGD), developed by the Krogh group…
☆15Aug 8, 2024Updated last year
OpenPipe / rl-experiments
View on GitHub
OpenPipe Reinforcement Learning Experiments
☆32Mar 14, 2025Updated last year
jamietso / RedlineNow
View on GitHub
Instant redline with AI summary
☆39Dec 7, 2025Updated 5 months ago
nyukat / MRI_AI
View on GitHub
This repository contains code that was used to train and evaluate deep learning models, as described in the article "Improving breast can…
☆16Aug 13, 2022Updated 3 years ago
pangeacyber / secure-chatgpt
View on GitHub
☆20Feb 20, 2026Updated 3 months ago
RedTopper / Text-Generation-Webui-Podman
View on GitHub
Generate Large Language Model text in a container.
☆20Mar 24, 2023Updated 3 years ago
hed-standard / hed-python
View on GitHub
Python validation, summary, and analysis tools for HED (Hierarchical Event Descriptors).
☆18May 21, 2026Updated last week
Ashx098 / sft-play
View on GitHub
☆51Oct 1, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dingjingtao / NegativeSamplerBPR
View on GitHub
☆10Jul 30, 2019Updated 6 years ago
simonw / datasette-openai
View on GitHub
SQL functions for calling OpenAI APIs
☆22Jan 14, 2023Updated 3 years ago
voytekresearch / misshapen
View on GitHub
Analyze the waveform shape of neural oscillations.
☆12Nov 14, 2018Updated 7 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
bertelschmitt / multistreamYOLO
View on GitHub
☆11Nov 27, 2025Updated 6 months ago
DSE-MSU / Recommender-System-Datasets
View on GitHub
A list of compatible datasets, noting other major repositories containing popular real-world datasets, along with sample code for a range…
☆12Mar 18, 2020Updated 6 years ago
razvanneculai / litecode
View on GitHub
CLI Tool for 8k context ai models
☆112Apr 28, 2026Updated last month