bentoml / IF-multi-GPUs-demo
☆12Updated last year
Alternatives and similar repositories for IF-multi-GPUs-demo:
Users that are interested in IF-multi-GPUs-demo are comparing it to the libraries listed below
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- API serving for your diffusers models☆11Updated last year
- ☆32Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆68Updated this week
- ☆21Updated this week
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆27Updated 6 months ago
- ☆31Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated last year
- ☆17Updated 2 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Local emulator for Hugging Face Inference Endpoints customer handlers☆25Updated last year
- ☆29Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated last week
- Recaption large (Web)Datasets with vllm and save the artifacts.☆50Updated 4 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆57Updated last year
- Gradio UI for a Cog API☆67Updated last year
- This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.☆17Updated 3 months ago
- Modified Beam Search with periodical restart☆12Updated 7 months ago
- ☆37Updated last year
- 🦖 X—LLM: Simple & Cutting Edge LLM Finetuning☆11Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 5 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆32Updated last month
- ☆20Updated 10 months ago
- ☆30Updated last year
- Retrieve the source code for any model made available on replicate.com!☆34Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year