huggingface / api-inference-communityLinks
☆172Updated 11 months ago
Alternatives and similar repositories for api-inference-community
Users that are interested in api-inference-community are comparing it to the libraries listed below
Sorting:
- The package used to build the documentation of our Hugging Face repos☆135Updated this week
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- ☆198Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆161Updated 2 years ago
- Hugging Face's Zapier Integration 🤗⚡️☆49Updated 2 years ago
- ☆142Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- manage histories of LLM applied applications☆91Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆154Updated 2 years ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_server☆44Updated 2 years ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 7 months ago
- experiments with inference on llama☆103Updated last year
- ☆85Updated 2 years ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- GitHub action that'll sync files from a GitHub Repo with the Hugging Face Hub 🤗☆78Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year
- Tune MPTs☆84Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- ☆125Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆116Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- Google TPU optimizations for transformers models☆132Updated last month
- DiffusionWithAutoscaler☆29Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆115Updated 2 years ago