Understanding the correlation between different LLM benchmarks
β29Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for understanding_llm_benchmarks
Users that are interested in understanding_llm_benchmarks are comparing it to the libraries listed below
Sorting:
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Aug 17, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.β14Mar 20, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.β23Sep 3, 2025Updated 6 months ago
- β27Aug 30, 2023Updated 2 years ago
- Adversarially Robust Generalization Just Requires More Unlabeled Dataβ11Aug 8, 2019Updated 6 years ago
- β14Jan 10, 2024Updated 2 years ago
- β67Mar 4, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.β16Aug 23, 2023Updated 2 years ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β37Oct 9, 2025Updated 4 months ago
- Set of scripts to finetune LLMsβ38Mar 30, 2024Updated last year
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding loβ¦β16Nov 27, 2024Updated last year
- β44Jan 24, 2024Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)β24Jul 12, 2025Updated 7 months ago
- [CoRL22] Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecastingβ22Dec 6, 2022Updated 3 years ago
- Self-Supervised Alignment with Mutual Informationβ20May 24, 2024Updated last year
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden Stateβ20Oct 24, 2025Updated 4 months ago
- β32Jan 1, 2024Updated 2 years ago
- β33Jul 8, 2024Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"β58Feb 29, 2024Updated 2 years ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIMβ61Apr 8, 2024Updated last year
- Multi-Target Embodied Question Answeringβ26Jul 17, 2020Updated 5 years ago
- β64Dec 19, 2025Updated 2 months ago
- Dateset Reset Policy Optimizationβ31Apr 12, 2024Updated last year
- β24Nov 10, 2020Updated 5 years ago
- β31Mar 23, 2024Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimizationβ27Jul 19, 2023Updated 2 years ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)β73Jun 25, 2024Updated last year
- A lightweight DeepPotentialMD with JAX backend, and more than that! Built for both performance and flexibility in pure Python.β34Jan 22, 2026Updated last month
- forked from DongZhouGu/arxiv-dailyβ22Nov 8, 2022Updated 3 years ago
- canvas-based talking head model using viseme dataβ32Sep 4, 2023Updated 2 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat PolitΓ¨cnica de Catalunya Β· BarcelonaTechβ13Jan 3, 2023Updated 3 years ago
- Sotopia-RL: Reward Design for Social Intelligenceβ46Jan 29, 2026Updated last month
- A tool to paste Excel ranges to Redditβ11Sep 20, 2025Updated 5 months ago
- β29Dec 28, 2025Updated 2 months ago
- β208Jan 14, 2026Updated last month
- Multipack distributed sampler for fast padding-free training of LLMsβ206Aug 10, 2024Updated last year
- REST: Retrieval-Based Speculative Decoding, NAACL 2024β214Sep 11, 2025Updated 5 months ago
- β72May 22, 2023Updated 2 years ago