dsdanielpark/open-llm-leaderboard-report

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dsdanielpark/open-llm-leaderboard-report)

dsdanielpark / open-llm-leaderboard-report

Weekly visualization report of Open LLM model performance based on 4 metrics.

☆86

Alternatives and similar repositories for open-llm-leaderboard-report

Users that are interested in open-llm-leaderboard-report are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dsdanielpark / open-interview
View on GitHub
Open Interview automates technical Q&A generation from resumes, offers document and audio outputs, and customizable settings for efficien…
☆19May 8, 2024Updated 2 years ago
dsdanielpark / ExceptNotifier
View on GitHub
The Python package ExceptNotifier enhances the try-except statement, allowing you to receive detailed error messages via email or messeng…
☆31Apr 15, 2024Updated 2 years ago
dsdanielpark / arxiv2text
View on GitHub
Converting PDF files to text, mainly with a focus on arXiv papers.
☆25Feb 19, 2024Updated 2 years ago
hollobit / Awesome-GenAITech
View on GitHub
Awesome-GenAITech: a curated list of Generative AI Techniques
☆11Jul 11, 2023Updated 3 years ago
openfeedback / superhf
View on GitHub
Open-source Human Feedback Library
☆11Oct 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
myeonghak / mlflow-rankfm
View on GitHub
An example of MLflow Tracking and Models Using Factorization Machine Recommender model library, rankfm.
☆10Sep 9, 2021Updated 4 years ago
WisdomShell / FreeEval
View on GitHub
☆19Aug 3, 2024Updated last year
mrconter1 / BenchmarkAggregator
View on GitHub
Comprehensive LLM evaluation framework: GPQA Diamond to Chatbot Arena. Tests all major models equally, easily extensible.
☆17Aug 22, 2024Updated last year
LAION-AI / math_problems-step-by-step_solutions
View on GitHub
Here we provide and collect many functions to generate math problem and step by step solutions for LLM training
☆19Jun 21, 2023Updated 3 years ago
dsdanielpark / co-coder
View on GitHub
Co-Coder is a Python package that streamlines error debugging from Open AI chat GPT and Google Bard by providing hints, example code, and…
☆45May 22, 2023Updated 3 years ago
dsdanielpark / hf-transllm
View on GitHub
LLMtranslator translates and generates text in multiple languages.
☆45May 10, 2024Updated 2 years ago
centerforaisafety / simple-evals
View on GitHub
Simple evaluation scripts for AI benchmarks with minimal dependencies.
☆20Updated this week
dsdanielpark / open-llm-datasets
View on GitHub
Repository for organizing datasets and papers used in Open LLM.
☆100Jul 6, 2023Updated 3 years ago
IST-DASLab / SparseFinetuning
View on GitHub
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆43Jan 15, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vicksEmmanuel / latent-gemma
View on GitHub
☆27Jan 14, 2025Updated last year
choosewhatulike / case2code
View on GitHub
☆17Apr 7, 2025Updated last year
anomic1911 / GNNExplainer-Experiments
View on GitHub
This repository contains the experiments performed on the GNNExplainer Code
☆16Oct 12, 2021Updated 4 years ago
Upaya07 / NeurIPS-llm-efficiency-challenge
View on GitHub
Code for NeurIPS LLM Efficiency Challenge
☆62Apr 9, 2024Updated 2 years ago
aifrenz / membership
View on GitHub
☆24Jul 23, 2024Updated 2 years ago
SALT-NLP / demonstrated-feedback
View on GitHub
☆131Oct 1, 2024Updated last year
LuoXiaoHeics / Continual-Tune
View on GitHub
☆10Feb 6, 2025Updated last year
mlabonne / tinytuner
View on GitHub
🐜🔧 A minimalistic tool to fine-tune your LLMs
☆19Aug 17, 2023Updated 2 years ago
PrasannS / rlhf-length-biases
View on GitHub
☆27Mar 13, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dmahan93 / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆16Aug 23, 2023Updated 2 years ago
MrBananaHuman / CounselGPT
View on GitHub
한국어 심리 상담 데이터셋
☆81Jun 20, 2023Updated 3 years ago
gauss5930 / iDUS
View on GitHub
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated 2 years ago
InflectionAI / Inflection-Benchmarks
View on GitHub
Public Inflection Benchmarks
☆67Mar 6, 2024Updated 2 years ago
armbues / SiLLM-examples
View on GitHub
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
☆16May 8, 2025Updated last year
LarkspurCA / androidweardocs
View on GitHub
Android wear developer docs
☆14Aug 25, 2018Updated 7 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
RUCAIBox / ChainLM
View on GitHub
☆31Mar 23, 2024Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
View on GitHub
☆75Sep 5, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ad-si / adriansieber-com
View on GitHub
My website & blog with articles about coding, tech, functional programming, …
☆10Jun 12, 2026Updated last month
abvijaykumar / rag-llamaindex-blog
View on GitHub
Source code used in the blog
☆12Feb 6, 2024Updated 2 years ago
samchaineau / llm_slerp_generation
View on GitHub
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆37Oct 9, 2025Updated 9 months ago
Nicolas-Yax / PhyloLM
View on GitHub
Genetics for Language Models
☆18Jul 1, 2024Updated 2 years ago
thoppe / The-Pile-PhilPapers
View on GitHub
Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.
☆20Aug 28, 2023Updated 2 years ago
lvwerra / deep-math
View on GitHub
Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"
☆30Mar 25, 2023Updated 3 years ago
Genius1237 / TyDiP
View on GitHub
TyDiP Multilingual Politeness dataset and code
☆12Oct 15, 2023Updated 2 years ago