A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
β32Sep 19, 2025Updated 8 months ago
Alternatives and similar repositories for py-txi
Users that are interested in py-txi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The backend behind the LLM-Perf Leaderboardβ11May 5, 2024Updated 2 years ago
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17May 1, 2023Updated 3 years ago
- π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtimeβ148Updated this week
- β32Nov 14, 2024Updated last year
- β14Mar 30, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"β13Jul 23, 2023Updated 2 years ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)β19Dec 8, 2023Updated 2 years ago
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated 2 years ago
- π€ Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AIβ23Feb 26, 2024Updated 2 years ago
- π LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.β14Jul 12, 2025Updated 11 months ago
- Model implementation for the contextual embeddings projectβ47Jun 2, 2025Updated last year
- German dataset for DPR model trainingβ19Jul 21, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ38Oct 16, 2025Updated 8 months ago
- β48Nov 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddingsβ13May 22, 2025Updated last year
- β17Jan 5, 2023Updated 3 years ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ288Jul 11, 2024Updated last year
- A framework for evaluating semantic search across custom datasets, metrics, and embedding backends.β39May 31, 2026Updated 2 weeks ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ36Nov 21, 2025Updated 6 months ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlibβ19Nov 19, 2021Updated 4 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iβ¦β29Mar 8, 2026Updated 3 months ago
- Efficiently find the best-suited language model (LM) for your NLP taskβ134Jul 26, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- portable Python ML-powered data botβ25Sep 27, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.β13Jul 14, 2025Updated 11 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ23Jun 30, 2025Updated 11 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE modelβ18Dec 22, 2023Updated 2 years ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β186Sep 23, 2024Updated last year
- [NeurIPS 2024] πΈ GlotCC Dataset and Piplineβ20Apr 6, 2025Updated last year
- A template code for running modular and reproducible experiments in pytorchβ13Sep 3, 2025Updated 9 months ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ21Apr 27, 2026Updated last month
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters β¦β77May 29, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python library for automatic training, optimization and comparison of Transformer models on most NLP tasks.β20May 6, 2023Updated 3 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β96May 28, 2026Updated 3 weeks ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.β11Jun 23, 2024Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β97Feb 9, 2023Updated 3 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β77Oct 19, 2024Updated last year
- This repository contains the code for the publication "Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Langβ¦β10Oct 26, 2023Updated 2 years ago
- β59Aug 19, 2025Updated 9 months ago