huggingface / doc-builderLinks
The package used to build the documentation of our Hugging Face repos
☆126Updated this week
Alternatives and similar repositories for doc-builder
Users that are interested in doc-builder are comparing it to the libraries listed below
Sorting:
- ☆124Updated 10 months ago
- ☆171Updated 6 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- Google TPU optimizations for transformers models☆120Updated 7 months ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Let's build better datasets, together!☆262Updated 8 months ago
- ☆199Updated last year
- experiments with inference on llama☆104Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆270Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆83Updated this week
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆143Updated last month
- ☆98Updated 2 months ago
- Developing tools to automatically analyze datasets☆74Updated 10 months ago
- Common Python utilities and GitHub Actions in Lightning Ecosystem☆57Updated last week
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- ☆19Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- ML/DL Math and Method notes☆63Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆159Updated last year
- ☆67Updated 3 years ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆206Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆193Updated last week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated last month
- An open collection of implementation tips, tricks and resources for training large language models☆478Updated 2 years ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated last year