MiuLab / LLM-Eval
☆12Updated last year
Alternatives and similar repositories for LLM-Eval:
Users that are interested in LLM-Eval are comparing it to the libraries listed below
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- ☆47Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- ☆45Updated 4 months ago
- ☆48Updated 3 months ago
- Utils for Unsloth☆44Updated this week
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆158Updated this week
- ☆40Updated 9 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 11 months ago
- The first dense retrieval model that can be prompted like an LM☆64Updated 5 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 9 months ago
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆13Updated 9 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆61Updated 2 months ago
- ☆87Updated last year
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆42Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 4 months ago
- An NVIDIA AI Workbench Example Project for Finetuning Llama 2☆28Updated 5 months ago
- ☆59Updated 2 weeks ago
- FuseAI Project☆83Updated 3 weeks ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆53Updated last week
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆52Updated 4 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 11 months ago
- ☆83Updated 4 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- ☆37Updated 6 months ago
- Train, tune, and infer Bamba model☆84Updated last month