huggingface / doc-builder
The package used to build the documentation of our Hugging Face repos
â111Updated this week
Alternatives and similar repositories for doc-builder:
Users that are interested in doc-builder are comparing it to the libraries listed below
- â123Updated 6 months ago
- [WIP] A đĨ interface for running code in the cloudâ85Updated 2 years ago
- Google TPU optimizations for transformers modelsâ109Updated 3 months ago
- â199Updated last year
- Manage scalable open LLM inference endpoints in Slurm clustersâ254Updated 9 months ago
- experiments with inference on llamaâ104Updated 10 months ago
- Pipeline for pulling and processing online language model pretraining data from the webâ177Updated last year
- â169Updated 2 months ago
- đšī¸ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.â136Updated 9 months ago
- đ¤ Disaggregators: Curated data labelers for in-depth analysis.â65Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)â101Updated last year
- Let's build better datasets, together!â259Updated 4 months ago
- **ARCHIVED** Filesystem interface to đ¤ Hubâ58Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.â82Updated last year
- â67Updated 2 years ago
- Common Python utilities and GitHub Actions in Lightning Ecosystemâ56Updated last week
- Datasets collection and preprocessings framework for NLP extreme multitask learningâ180Updated 4 months ago
- â115Updated 3 weeks ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.â157Updated last year
- Command Line Interface for Hugging Face Inference Endpointsâ66Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesâ198Updated 11 months ago
- Load compute kernels from the Hubâ115Updated last week
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.â93Updated 2 years ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizersâ92Updated 9 months ago
- â49Updated 2 months ago
- A library for squeakily cleaning and filtering language datasets.â47Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oâĻâ131Updated 4 months ago
- git extension for {collaborative, communal, continual} model developmentâ211Updated 5 months ago
- Python API for https://vespa.ai, the open big data serving engineâ121Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.â37Updated last year