huggingface / huggingface-llama-recipesLinks
β668Updated last month
Alternatives and similar repositories for huggingface-llama-recipes
Users that are interested in huggingface-llama-recipes are comparing it to the libraries listed below
Sorting:
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,629Updated this week
- A reading list on LLM based Synthetic Data Generation π₯β1,306Updated 2 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,757Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,495Updated 2 weeks ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,434Updated 5 months ago
- Build datasets using natural languageβ492Updated last month
- β520Updated 7 months ago
- Tool for generating high quality Synthetic datasetsβ948Updated last week
- π€ Benchmark Large Language Models Reliably On Your Dataβ329Updated this week
- An Open Source Toolkit For LLM Distillationβ651Updated 3 weeks ago
- Bringing BERT into modernity via both architecture changes and scalingβ1,410Updated this week
- Automatic evals for LLMsβ429Updated 2 weeks ago
- Official inference library for pre-processing of Mistral modelsβ742Updated this week
- Stanford NLP Python library for Representation Finetuning (ReFT)β1,490Updated 4 months ago
- Minimalistic large language model 3D-parallelism trainingβ1,926Updated last week
- Evaluate your LLM's response with Prometheus and GPT4 π―β954Updated last month
- Automatically evaluate your LLMs in Google Colabβ641Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,411Updated 2 weeks ago
- Scalable data pre processing and curation toolkit for LLMsβ949Updated this week
- β1,025Updated 6 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β797Updated 4 months ago
- Synthetic data curation for post-training and structured data extractionβ1,404Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β313Updated 2 weeks ago
- Easily embed, cluster and semantically label text datasetsβ549Updated last year
- Late Interaction Models Training & Retrievalβ444Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ881Updated last month
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.β306Updated 2 months ago
- Let's build better datasets, together!β259Updated 6 months ago
- β900Updated 9 months ago
- Recipes to scale inference-time compute of open modelsβ1,095Updated last month