IBM / text-generation-inferenceLinks
IBM development fork of https://github.com/huggingface/text-generation-inference
β60Updated 3 weeks ago
Alternatives and similar repositories for text-generation-inference
Users that are interested in text-generation-inference are comparing it to the libraries listed below
Sorting:
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β44Updated this week
- β214Updated this week
- LM engine is a library for pretraining/finetuning LLMsβ55Updated this week
- Inference server benchmarking toolβ67Updated last month
- Benchmark suite for LLMs from Fireworks.aiβ75Updated 2 weeks ago
- Python library for Synthetic Data Generationβ42Updated this week
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ42Updated this week
- β60Updated 2 months ago
- β49Updated 6 months ago
- vLLM adapter for a TGIS-compatible gRPC server.β30Updated this week
- β53Updated 8 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)β98Updated this week
- β66Updated last year
- β99Updated this week
- Google TPU optimizations for transformers modelsβ112Updated 4 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuningβ41Updated 3 months ago
- Pre-training code for CrystalCoder 7B LLMβ54Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ86Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β60Updated this week
- β38Updated last month
- Train, tune, and infer Bamba modelβ127Updated last month
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β137Updated 10 months ago
- β121Updated last month
- β52Updated 6 months ago
- SGLang is fast serving framework for large language models and vision language models.β23Updated 3 months ago
- β53Updated last year
- Cray-LM unified training and inference stack.β22Updated 4 months ago
- Module, Model, and Tensor Serialization/Deserializationβ232Updated last week
- Large Language Model Text Generation Inference on Habana Gaudiβ33Updated 2 months ago
- Data preparation code for Amber 7B LLMβ90Updated last year