IlyasMoutawwakil / py-txiLinks

A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.

☆33

Alternatives and similar repositories for py-txi

Users that are interested in py-txi are comparing it to the libraries listed below

Sorting:

AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆62Updated 2 months ago
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated last year
pacman100 / peft-codegen-25
☆23Updated 2 years ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 9 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆83Updated last week
mrmps / ai-chunker
Chunk your text using gpt4o-mini more accurately
☆44Updated 11 months ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated last year
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year
PrithivirajDamodaran / SPLADERunner
Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…
☆32Updated 11 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
PrithivirajDamodaran / blitz-embed
C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…
☆22Updated last year
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆84Updated 2 months ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
nbroad1881 / strideformer
Using short models to classify long texts
☆21Updated 2 years ago
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆37Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆42Updated last year
s-smits / modernbert-finetune
Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training
☆67Updated 5 months ago
unicamp-dl / InRanker
☆48Updated last year
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated 9 months ago
illuin-tech / contextual-embeddings
Model implementation for the contextual embeddings project
☆35Updated 2 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 8 months ago
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆10Updated last year
huggingface / competitions
☆124Updated 9 months ago
HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆86Updated last month