huggingface / api-inference-communityLinks
☆171Updated 4 months ago
Alternatives and similar repositories for api-inference-community
Users that are interested in api-inference-community are comparing it to the libraries listed below
Sorting:
- The package used to build the documentation of our Hugging Face repos☆117Updated last week
- manage histories of LLM applied applications☆90Updated last year
- Hugging Face's Zapier Integration 🤗⚡️☆47Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated 2 years ago
- ☆199Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Accelerated inference of 🤗 models using FuriosaAI NPU chips.☆26Updated 2 weeks ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 8 months ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆44Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Google TPU optimizations for transformers models☆113Updated 5 months ago
- ☆124Updated 7 months ago
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- An OpenAI Completions API compatible server for NLP transformers models☆65Updated last year
- ☆130Updated 3 years ago
- ☆141Updated last year
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- ☆84Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆103Updated last month
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month