IBM / text-generation-inferenceLinks
IBM development fork of https://github.com/huggingface/text-generation-inference
☆61Updated 3 months ago
Alternatives and similar repositories for text-generation-inference
Users that are interested in text-generation-inference are comparing it to the libraries listed below
Sorting:
- Benchmark suite for LLMs from Fireworks.ai☆79Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆266Updated 10 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆35Updated this week
- LM engine is a library for pretraining/finetuning LLMs☆63Updated this week
- experiments with inference on llama☆104Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- ☆53Updated 2 months ago
- ☆63Updated 4 months ago
- ☆199Updated last year
- Train, tune, and infer Bamba model☆131Updated 2 months ago
- Python library for Synthetic Data Generation☆42Updated this week
- Data preparation code for Amber 7B LLM☆91Updated last year
- ☆238Updated last week
- 👷 Build compute kernels☆106Updated last week
- ☆133Updated last week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆47Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆222Updated this week
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆41Updated this week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆206Updated this week
- Inference server benchmarking tool☆93Updated 3 months ago
- Google TPU optimizations for transformers models☆118Updated 7 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- ☆15Updated this week
- ☆55Updated 9 months ago
- ☆66Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆87Updated last week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆83Updated 2 weeks ago
- A collection of reproducible inference engine benchmarks☆32Updated 4 months ago
- Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.☆102Updated last year