huggingface / huggingface-llama-recipes
β622Updated 3 months ago
Alternatives and similar repositories for huggingface-llama-recipes:
Users that are interested in huggingface-llama-recipes are comparing it to the libraries listed below
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,084Updated 2 months ago
- A reading list on LLM based Synthetic Data Generation π₯β1,211Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,313Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,568Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,316Updated last week
- An Open Source Toolkit For LLM Distillationβ544Updated 2 months ago
- β502Updated 4 months ago
- awesome synthetic (text) datasetsβ265Updated 4 months ago
- Evaluate your LLM's response with Prometheus and GPT4 π―β885Updated last week
- Automatic evals for LLMsβ340Updated this week
- Automatically evaluate your LLMs in Google Colabβ603Updated 10 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)β1,445Updated last month
- β1,100Updated 3 weeks ago
- Bringing BERT into modernity via both architecture changes and scalingβ1,283Updated this week
- β1,567Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,345Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β261Updated 3 months ago
- β694Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,312Updated this week
- Recipes to scale inference-time compute of open modelsβ1,044Updated 3 weeks ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β753Updated last month
- β1,011Updated 3 months ago
- Synthetic data curation for post-training and structured data extractionβ1,049Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ652Updated 2 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)β423Updated 5 months ago
- Official repository for ORPOβ445Updated 9 months ago
- Automated Evaluation of RAG Systemsβ562Updated 4 months ago
- Let's build better datasets, together!β257Updated 3 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuningβ349Updated 6 months ago
- Easily embed, cluster and semantically label text datasetsβ516Updated 11 months ago