IBM / text-generation-inferenceLinks
IBM development fork of https://github.com/huggingface/text-generation-inference
☆61Updated 4 months ago
Alternatives and similar repositories for text-generation-inference
Users that are interested in text-generation-inference are comparing it to the libraries listed below
Sorting:
- Benchmark suite for LLMs from Fireworks.ai☆83Updated last week
- vLLM adapter for a TGIS-compatible gRPC server.☆39Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆266Updated 11 months ago
- Inference server benchmarking tool☆98Updated 4 months ago
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆47Updated last week
- ☆57Updated 3 months ago
- experiments with inference on llama☆104Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated last year
- ☆239Updated last week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆208Updated this week
- ☆199Updated last year
- LM engine is a library for pretraining/finetuning LLMs☆65Updated this week
- Train, tune, and infer Bamba model☆131Updated 3 months ago
- Data preparation code for Amber 7B LLM☆91Updated last year
- 👷 Build compute kernels☆136Updated this week
- Google TPU optimizations for transformers models☆120Updated 7 months ago
- ☆63Updated 5 months ago
- ☆15Updated this week
- ☆54Updated 10 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆224Updated last week
- A collection of all available inference solutions for the LLMs☆91Updated 6 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆84Updated 2 weeks ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Updated last year
- ☆39Updated 2 years ago
- Easy and Efficient Quantization for Transformers☆203Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆88Updated 3 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆208Updated last week
- ☆51Updated last year