Understanding the correlation between different LLM benchmarks
β29Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for understanding_llm_benchmarks
Users that are interested in understanding_llm_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Aug 17, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.β14Mar 20, 2024Updated 2 years ago
- Simple Model Similarities Analysisβ21Feb 3, 2024Updated 2 years ago
- Distill thinking dataset more compactly and accurately!β38Jun 6, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β67Mar 4, 2024Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)β25May 29, 2026Updated 2 weeks ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β37Oct 9, 2025Updated 8 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Largeβ¦β15Jun 4, 2025Updated last year
- β46Jan 24, 2024Updated 2 years ago
- torch.compile artifacts for common deep learning models, can be used as a learning resource for torch.compileβ19Dec 22, 2023Updated 2 years ago
- β32Jan 1, 2024Updated 2 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representationsβ12Sep 4, 2024Updated last year
- Official repository Flash Local Linear Attentionβ36May 28, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DAM Data Acquisition for ML Benchmark, as part of the DataPerf benchmark suite, https://dataperf.org/β25May 25, 2023Updated 3 years ago
- A benchmark for emotional intelligence in large language modelsβ430Jul 26, 2024Updated last year
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready tβ¦β14Jun 23, 2023Updated 2 years ago
- Self-Supervised Alignment with Mutual Informationβ20May 24, 2024Updated 2 years ago
- Structured output benchmarks comparing DSPy and BAML with different LLMsβ28Dec 23, 2025Updated 5 months ago
- β33Jul 8, 2024Updated last year
- Genetics for Language Modelsβ17Jul 1, 2024Updated last year
- β56Nov 6, 2024Updated last year
- A comprehensive deep dive into the world of tokensβ229Jun 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multipack distributed sampler for fast padding-free training of LLMsβ207Aug 10, 2024Updated last year
- [CoRL22] Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecastingβ22Dec 6, 2022Updated 3 years ago
- Just some nice dice in Pythonβ22Updated this week
- β72May 22, 2023Updated 3 years ago
- Structured Generation Evalsβ14Sep 25, 2024Updated last year
- A fun PGM experienceβ15May 19, 2025Updated last year
- Local emulator for Hugging Face Inference Endpoints customer handlersβ27May 28, 2026Updated 2 weeks ago
- β128May 19, 2024Updated 2 years ago
- The git repository of Modular Prompted Chatbot paperβ35May 24, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden Stateβ21Oct 24, 2025Updated 7 months ago
- Code for the NeurIPS 2020 paper "Improved analysis of clippind algorithms for non-convex optimization", including various clipping algoriβ¦β10Feb 17, 2021Updated 5 years ago
- Markdown + GitHub -> Blogβ12Dec 16, 2023Updated 2 years ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)β12May 17, 2025Updated last year
- Sotopia-RL: Reward Design for Social Intelligenceβ50Apr 1, 2026Updated 2 months ago
- gpt-3.5-turbo-instruct, prompted with PGN, vs Stockfish Level 4 on LiChessβ15Sep 19, 2023Updated 2 years ago
- β28Feb 24, 2024Updated 2 years ago