IBM / text-generation-inferenceLinks
IBM development fork of https://github.com/huggingface/text-generation-inference
☆63Updated 4 months ago
Alternatives and similar repositories for text-generation-inference
Users that are interested in text-generation-inference are comparing it to the libraries listed below
Sorting:
- Benchmark suite for LLMs from Fireworks.ai☆89Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆50Updated last week
- LM engine is a library for pretraining/finetuning LLMs☆113Updated this week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆56Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆267Updated last month
- ☆278Updated last week
- Python library for Synthetic Data Generation☆52Updated last month
- ☆67Updated 10 months ago
- experiments with inference on llama☆103Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- ☆198Updated last year
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆282Updated this week
- Data preparation code for Amber 7B LLM☆94Updated last year
- Google TPU optimizations for transformers models☆135Updated last week
- Pre-training code for CrystalCoder 7B LLM☆57Updated last year
- ☆39Updated 3 years ago
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- ☆44Updated this week
- ☆16Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆131Updated 4 months ago
- Module, Model, and Tensor Serialization/Deserialization☆286Updated 5 months ago
- ☆56Updated last year
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆213Updated last week
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM☆220Updated this week
- Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.☆106Updated last year
- Easy and Efficient Quantization for Transformers☆204Updated last week
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆47Updated last week
- ☆140Updated 5 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year