MiuLab / LLM-EvalLinks
☆15Updated 2 years ago
Alternatives and similar repositories for LLM-Eval
Users that are interested in LLM-Eval are comparing it to the libraries listed below
Sorting:
- Train, tune, and infer Bamba model☆134Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated last week
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago
- Data preparation code for Amber 7B LLM☆92Updated last year
- ☆55Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- ☆48Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- ☆35Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 10 months ago
- ☆97Updated last year
- Verifiers for LLM Reinforcement Learning☆74Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆80Updated this week
- Aioli: A unified optimization framework for language model data mixing☆27Updated 9 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆56Updated this week
- ☆43Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆80Updated last year
- Library to facilitate pruning of LLMs based on context☆32Updated last year
- A repository for research on medium sized language models.☆78Updated last year
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14Updated last year
- ☆40Updated 4 months ago
- ☆72Updated last year
- This is the official repository for Inheritune.☆115Updated 8 months ago
- Evaluating LLMs with fewer examples☆163Updated last year
- ☆77Updated last month
- ☆50Updated last year