MiuLab / LLM-EvalLinks
☆15Updated 2 years ago
Alternatives and similar repositories for LLM-Eval
Users that are interested in LLM-Eval are comparing it to the libraries listed below
Sorting:
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- ☆25Updated last month
- ☆55Updated last year
- Training hybrid models for dummies.☆29Updated last month
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- ☆36Updated 4 months ago
- Data preparation code for Amber 7B LLM☆93Updated last year
- Open Implementations of LLM Analyses☆108Updated last year
- ☆75Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 7 months ago
- Aioli: A unified optimization framework for language model data mixing☆31Updated 11 months ago
- ☆80Updated last month
- ☆88Updated last week
- ☆43Updated last year
- ☆41Updated 6 months ago
- MatFormer repo☆66Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆123Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆37Updated 4 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated 2 years ago
- ☆68Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated 10 months ago
- Train, tune, and infer Bamba model☆137Updated 6 months ago
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆66Updated 3 weeks ago