Understanding the correlation between different LLM benchmarks
☆29Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for understanding_llm_benchmarks
Users that are interested in understanding_llm_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jan 10, 2024Updated 2 years ago
- ☆28Aug 30, 2023Updated 2 years ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 9 months ago
- ☆45Jan 24, 2024Updated 2 years ago
- torch.compile artifacts for common deep learning models, can be used as a learning resource for torch.compile☆19Dec 22, 2023Updated 2 years ago
- An NLP research and data collection platform.☆17Mar 13, 2024Updated 2 years ago
- Example code on how to generate viseme json☆14Feb 23, 2023Updated 3 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- A benchmark for emotional intelligence in large language models☆422Jul 26, 2024Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆37Jul 6, 2023Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 3 months ago
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite 💡☆17Apr 7, 2025Updated last year
- ☆33Jul 8, 2024Updated last year
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆56Nov 6, 2024Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- [CoRL22] Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecasting☆22Dec 6, 2022Updated 3 years ago
- Just some nice dice in Python☆21Jan 6, 2026Updated 3 months ago
- ☆72May 22, 2023Updated 2 years ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 6, 2026Updated last week
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A fun PGM experience☆15May 19, 2025Updated 10 months ago
- Local emulator for Hugging Face Inference Endpoints customer handlers☆27Apr 3, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The git repository of Modular Prompted Chatbot paper☆35May 24, 2023Updated 2 years ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆21Oct 24, 2025Updated 5 months ago
- Code for the NeurIPS 2020 paper "Improved analysis of clippind algorithms for non-convex optimization", including various clipping algori…☆10Feb 17, 2021Updated 5 years ago
- Markdown + GitHub -> Blog☆13Dec 16, 2023Updated 2 years ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated 10 months ago
- gpt-3.5-turbo-instruct, prompted with PGN, vs Stockfish Level 4 on LiChess☆15Sep 19, 2023Updated 2 years ago
- Sotopia-RL: Reward Design for Social Intelligence☆49Apr 1, 2026Updated 2 weeks ago