IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆60Updated 3 months ago
Alternatives and similar repositories for text-generation-inference:
Users that are interested in text-generation-inference are comparing it to the libraries listed below
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆40Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆70Updated 2 months ago
- ☆48Updated 5 months ago
- ☆66Updated 10 months ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆47Updated this week
- Data preparation code for Amber 7B LLM☆87Updated 11 months ago
- Inference server benchmarking tool☆49Updated 2 weeks ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆82Updated last month
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 11 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆26Updated this week
- experiments with inference on llama☆104Updated 10 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Train, tune, and infer Bamba model☆88Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆113Updated last week
- ☆57Updated 2 weeks ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆34Updated this week
- ☆190Updated 2 weeks ago
- SGLang is fast serving framework for large language models and vision language models.☆22Updated 2 months ago
- ☆53Updated 10 months ago
- ☆33Updated 9 months ago
- Python library for Synthetic Data Generation☆40Updated this week
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- Google TPU optimizations for transformers models☆107Updated 2 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆120Updated last year
- ☆50Updated 5 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆156Updated this week
- Large Language Model Text Generation Inference on Habana Gaudi☆32Updated 3 weeks ago