huggingface / optimum-furiosa
Accelerated inference of 🤗 models using FuriosaAI NPU chips.
☆26Updated 10 months ago
Alternatives and similar repositories for optimum-furiosa:
Users that are interested in optimum-furiosa are comparing it to the libraries listed below
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- ☆17Updated 2 months ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Evaluate Transformers from the Hub 🔥☆13Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆85Updated last year
- Local emulator for Hugging Face Inference Endpoints customer handlers☆25Updated last year
- ML/DL Math and Method notes☆60Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated last year
- PB-LLM: Partially Binarized Large Language Models☆151Updated last year
- ☆66Updated 10 months ago
- Google TPU optimizations for transformers models☆108Updated 3 months ago
- Load compute kernels from the Hub☆115Updated last week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆185Updated this week
- ☆54Updated last year
- ☆26Updated 4 months ago
- ☆39Updated 2 years ago
- Hugging Face's Zapier Integration 🤗⚡️☆48Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆49Updated last week
- GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ☆103Updated last year
- Python bindings for ggml☆140Updated 7 months ago
- AMD related optimizations for transformer models☆75Updated 5 months ago
- ☆27Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- ☆68Updated 3 weeks ago
- ☆16Updated 2 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- ☆50Updated last year
- ☆22Updated last year