IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆60Updated 3 months ago
Alternatives and similar repositories for text-generation-inference:
Users that are interested in text-generation-inference are comparing it to the libraries listed below
- Benchmark suite for LLMs from Fireworks.ai☆70Updated last month
- ☆174Updated this week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆38Updated this week
- Inference server benchmarking tool☆38Updated this week
- Dolomite Engine is a library for pretraining/finetuning LLMs☆46Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆25Updated this week
- Python library for Synthetic Data Generation☆35Updated this week
- ☆24Updated 6 months ago
- ☆49Updated 4 months ago
- ☆55Updated 2 months ago
- experiments with inference on llama☆104Updated 9 months ago
- ☆27Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- A collection of all available inference solutions for the LLMs☆82Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆32Updated last week
- Cray-LM unified training and inference stack.☆21Updated 2 months ago
- Train, tune, and infer Bamba model☆87Updated 2 months ago
- A pipeline for LLM knowledge distillation☆99Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆151Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆236Updated this week
- ☆34Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Google TPU optimizations for transformers models☆104Updated 2 months ago
- ☆66Updated 10 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆52Updated this week
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆38Updated last month
- Data preparation code for Amber 7B LLM☆86Updated 10 months ago
- codebase release for EMNLP2023 paper publication☆19Updated last year