huggingface / dedupe_estimatorLinks

Chunk Dedupe Estimation

☆15

Alternatives and similar repositories for dedupe_estimator

Users that are interested in dedupe_estimator are comparing it to the libraries listed below

Sorting:

xetdata / xet-core
Rust crates for XetHub
☆51Updated 9 months ago
huggingface / candle-cublaslt
☆13Updated last year
aeturrell / smartrappy
Smart reproducible analytical pipeline inspection
☆17Updated 3 months ago
Florents-Tselai / tsellm
tsellm: LLMs in SQLite and DuckDB
☆25Updated 3 months ago
robinvandernoord / uvenv
uvenv: pipx for uv (🦀) on Linux and macOS
☆73Updated last week
lucyknada / detective-needle-llm
☆12Updated 10 months ago
huggingface / xet-core
xet client tech, used in huggingface_hub
☆148Updated last week
skeeto / illume
scriptable command line program for LLM interfacing
☆82Updated 2 weeks ago
dwarvesf / llm-hosting
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…
☆23Updated 5 months ago
antirez / LLM-FTC-sampling
First token cutoff sampling inference example
☆30Updated last year
hyehudai / wireduck
Duckdb extension to read pcap files
☆43Updated 3 months ago
oxylabs / httpx-vs-requests-vs-aiohttp
See how HTTPX, Requests, and AIOHTTP libraries compare for sending network requests and find out which one may fit your case better.
☆19Updated last month
rabilrbl / llamafile-builder
A simple github actions script to build a llamafile and uploads to huggingface
☆15Updated last year
tabsdata / tabsdata
A Pub/Sub for Tables based data integration platform, to discover, publish, modify and consume data effortlessly.
☆35Updated 2 weeks ago
grll / open-mcp-proxy
An open source MCP proxy.
☆13Updated 7 months ago
Codys12 / airllm
AirLLM 70B inference with single 4GB GPU
☆14Updated last month
voyage-ai / voyageai-python
Voyage AI Official Python Library
☆66Updated 2 weeks ago
kagisearch / kagiapi
A Python package for Kagi Search API.
☆61Updated 3 months ago
mozilla-ai / lm-buddy
Your buddy in the (L)LM space.
☆64Updated 10 months ago
furiousteabag / vram-calculator
Transformer GPU VRAM estimator
☆66Updated last year
prem-research / prem-operator
📡 Deploy AI models and apps to Kubernetes without developing a hernia
☆32Updated last year
coreweave / ml-containers
☆38Updated this week
loicalleyne / bodkin
Go library for decoding generic map values and native Go structures into Arrow.
☆16Updated this week
friendliai / friendli-client
[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI
☆48Updated last month
mathpn / llm-docsmith
Generate Python docstrings automatically with LLM and syntax trees
☆16Updated last month
fsspec / opendalfs
OpenDAL fsspec integration
☆31Updated 2 months ago
cdalar / onctl
🤖 manage virtual machines 🖥️ in multi cloud ☁️
☆48Updated this week
duckdb / duckdb-mysql
☆71Updated last month
ibm-granite / granite-3.1-language-models
Granite 3.1 Language Models
☆117Updated last month
auxten / SQL-On-Everything
Query on Everything with SQL
☆17Updated 9 months ago