This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.
☆98Sep 5, 2025Updated 7 months ago
Alternatives and similar repositories for 41-llms-evaluated-on-19-benchmarks
Users that are interested in 41-llms-evaluated-on-19-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- “A locally hosted, memory-aware AI microservice—designed for cultural continuity, decentralized intelligence, and ethical autonomy.”☆28May 1, 2025Updated 11 months ago
- llm-eval-simple is a simple LLM evaluation framework with intermediate actions and prompt pattern selection☆65Feb 28, 2026Updated last month
- An eternal dialogue between AI models across versions. Started by Claude Opus 4 with 50 minutes to create a legacy.☆13Jun 2, 2025Updated 10 months ago
- Quadratic formula implemented in real life.☆16Aug 12, 2025Updated 8 months ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A selection of prompts designed for use with Open Web UI (rather than conventional prompts, they're more useful for steering existing con…☆39Feb 22, 2025Updated last year
- DSPY Experiments☆15May 2, 2024Updated last year
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆12Jun 23, 2024Updated last year
- ☆23Feb 23, 2026Updated last month
- ☆40Feb 25, 2026Updated last month
- "Microsoft Power BI Performance Best Practices - Second Edition, published by Packt"☆12Mar 2, 2026Updated last month
- Simple vLLM container deployment for Qwen3-Omni-30B-A3B-Instruct with up-to-date CUDA☆34Sep 28, 2025Updated 6 months ago
- ☆15Sep 13, 2024Updated last year
- the rent a hal project for AI☆21Apr 11, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python scripts to read a Portuguese Wikipedia XML dump file, parse it and generate plain text files.☆14Mar 12, 2014Updated 12 years ago
- A GrowtopiaBot can run on Linux, Credit = DrOreo002 and GrowtopiaNoobs☆17May 8, 2022Updated 3 years ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Jan 14, 2024Updated 2 years ago
- AI-powered Python library that converts any document (PDF, Word, Excel, PowerPoint, HTML) to clean Markdown while preserving complex tabl…☆48Mar 18, 2026Updated last month
- ☆48Apr 29, 2025Updated 11 months ago
- Code for using ESPhome with esp32s3 board.☆15Apr 9, 2025Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated last year
- Provide LLMs hosted, clean markdown documentation of libraries and frameworks☆35Apr 22, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Instant redline with AI summary☆38Dec 7, 2025Updated 4 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- ☆11Nov 27, 2025Updated 4 months ago
- ☆19Dec 2, 2024Updated last year
- ☆15May 14, 2024Updated last year
- Monolithic (Single) Docker Container for Obico Server☆12Updated this week
- A Multi-Agent System☆32Apr 28, 2025Updated 11 months ago
- Python script to scan for BLE devices and print out their attributes☆12Apr 25, 2021Updated 4 years ago
- A vuejs component to print javascript objects☆12Jun 6, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- synthetic dataset generation workflow using local file resources for finetuning llms.☆82Oct 9, 2025Updated 6 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆47Mar 19, 2026Updated last month
- Script Execution service☆12Nov 21, 2016Updated 9 years ago
- ☆30Mar 10, 2024Updated 2 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- ☆21Mar 19, 2024Updated 2 years ago