MiuLab / LLM-Eval
☆14Updated last year
Alternatives and similar repositories for LLM-Eval:
Users that are interested in LLM-Eval are comparing it to the libraries listed below
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated last month
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago
- ☆40Updated 10 months ago
- A repository for research on medium sized language models.☆76Updated 10 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆44Updated 8 months ago
- Train, tune, and infer Bamba model☆87Updated 2 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆54Updated last week
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆55Updated 11 months ago
- ☆74Updated 7 months ago
- ☆47Updated 7 months ago
- ☆19Updated 4 months ago
- Exploring Model Kinship for Merging Large Language Models☆23Updated last month
- ☆15Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- ☆24Updated 6 months ago
- Evaluating LLMs with fewer examples☆148Updated 11 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆41Updated last year
- ☆125Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆61Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆76Updated 6 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated last month
- ☆54Updated 5 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆48Updated last month
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- ☆60Updated 11 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- ☆48Updated 4 months ago