Understanding the correlation between different LLM benchmarks
β29Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for understanding_llm_benchmarks
Users that are interested in understanding_llm_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Aug 17, 2023Updated 2 years ago
- Simple Model Similarities Analysisβ21Feb 3, 2024Updated 2 years ago
- Distill thinking dataset more compactly and accurately!β38Jun 6, 2025Updated 11 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.β23Sep 3, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Using multiple LLMs for ensemble Forecastingβ16Jan 17, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.β16Aug 23, 2023Updated 2 years ago
- Adversarially Robust Generalization Just Requires More Unlabeled Dataβ11Aug 8, 2019Updated 6 years ago
- β14Jan 10, 2024Updated 2 years ago
- β28Aug 30, 2023Updated 2 years ago
- Set of scripts to finetune LLMsβ38Mar 30, 2024Updated 2 years ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β37Oct 9, 2025Updated 6 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Largeβ¦β15Jun 4, 2025Updated 11 months ago
- torch.compile artifacts for common deep learning models, can be used as a learning resource for torch.compileβ19Dec 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An NLP research and data collection platform.β17Mar 13, 2024Updated 2 years ago
- β32Jan 1, 2024Updated 2 years ago
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.β16Sep 13, 2025Updated 7 months ago
- Example code on how to generate viseme jsonβ14Feb 23, 2023Updated 3 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representationsβ12Sep 4, 2024Updated last year
- DAM Data Acquisition for ML Benchmark, as part of the DataPerf benchmark suite, https://dataperf.org/β25May 25, 2023Updated 2 years ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for easβ¦β19Mar 14, 2025Updated last year
- Experimental wasm32-unknown-wasi runtime for Python code executionβ40Nov 28, 2024Updated last year
- A benchmark for emotional intelligence in large language modelsβ424Jul 26, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready tβ¦β14Jun 23, 2023Updated 2 years ago
- Self-Supervised Alignment with Mutual Informationβ20May 24, 2024Updated last year
- Structured output benchmarks comparing DSPy and BAML with different LLMsβ28Dec 23, 2025Updated 4 months ago
- β33Jul 8, 2024Updated last year
- β56Nov 6, 2024Updated last year
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browserβ13Jan 14, 2026Updated 3 months ago
- Multipack distributed sampler for fast padding-free training of LLMsβ208Aug 10, 2024Updated last year
- REST: Retrieval-Based Speculative Decoding, NAACL 2024β218Mar 5, 2026Updated 2 months ago
- [CoRL22] Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecastingβ22Dec 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Just some nice dice in Pythonβ21Jan 6, 2026Updated 4 months ago
- β72May 22, 2023Updated 2 years ago
- β167Aug 8, 2025Updated 8 months ago
- A fork of sqlite-utils with CLI etc removedβ17Apr 28, 2026Updated last week
- Structured Generation Evalsβ14Sep 25, 2024Updated last year
- β128May 19, 2024Updated last year
- A lightweight DeepPotentialMD with JAX backend, and more than that! Built for both performance and flexibility in pure Python.β38Apr 29, 2026Updated last week