IBM / text-generation-inferenceLinks
IBM development fork of https://github.com/huggingface/text-generation-inference
â63Updated 4 months ago
Alternatives and similar repositories for text-generation-inference
Users that are interested in text-generation-inference are comparing it to the libraries listed below
Sorting:
- Benchmark suite for LLMs from Fireworks.aiâ89Updated this week
- đ Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.â56Updated last week
- vLLM adapter for a TGIS-compatible gRPC server.â50Updated last week
- LM engine is a library for pretraining/finetuning LLMsâ113Updated this week
- â67Updated 10 months ago
- experiments with inference on llamaâ103Updated last year
- â16Updated 2 months ago
- â76Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsâ267Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Rayâ130Updated 4 months ago
- đšī¸ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.â138Updated last year
- Python library for Synthetic Data Generationâ52Updated last month
- â280Updated this week
- â198Updated 2 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsâ93Updated this week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.â32Updated 4 months ago
- â56Updated last year
- SGLang is fast serving framework for large language models and vision language models.â32Updated 2 months ago
- Accelerating your LLM training to full speed! Made with â¤ī¸ by ServiceNow Researchâ287Updated this week
- Google TPU optimizations for transformers modelsâ134Updated 2 weeks ago
- â44Updated this week
- OpenAI compatible API for TensorRT LLM triton backendâ220Updated last year
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLMâ228Updated this week
- A collection of reproducible inference engine benchmarksâ38Updated 9 months ago
- đĻ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data âĻâ212Updated 2 weeks ago
- Train, tune, and infer Bamba modelâ137Updated 8 months ago
- A collection of all available inference solutions for the LLMsâ94Updated 11 months ago
- Large Language Model Text Generation Inference on Habana Gaudiâ34Updated 10 months ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataâ49Updated this week
- Lightweight demos for finetuning LLMs. Powered by đ¤ transformers and open-source datasets.â77Updated last year