huggingface / dedupe_estimatorLinks
Chunk Dedupe Estimation
☆18Updated 11 months ago
Alternatives and similar repositories for dedupe_estimator
Users that are interested in dedupe_estimator are comparing it to the libraries listed below
Sorting:
- Rust crates for XetHub☆69Updated 11 months ago
- Smart reproducible analytical pipeline inspection☆20Updated last week
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 7 months ago
- ☆12Updated last year
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆21Updated last year
- tsellm: LLMs in SQLite and DuckDB☆24Updated 5 months ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated 3 months ago
- ☆12Updated last year
- See how HTTPX, Requests, and AIOHTTP libraries compare for sending network requests and find out which one may fit your case better.☆19Updated 2 weeks ago
- First token cutoff sampling inference example☆30Updated last year
- FalkorDB-Browser is a visualization UI for FalkorDB.☆56Updated this week
- Like grep but with natural language queries☆50Updated last year
- Your buddy in the (L)LM space.☆64Updated last year
- Granite 3.1 Language Models☆127Updated 3 months ago
- Blueprint by Mozilla.ai for answering questions about structured documents☆36Updated 7 months ago
- ☆39Updated last week
- Embedding models from Jina AI☆65Updated last year
- Run models distributed as GGUF files using LLM☆76Updated 10 months ago
- Python SDK for XetHub☆59Updated 11 months ago
- This is an opensource project allowing you to compare two LLM's head to head with a given prompt, it has a wide range of supported models…☆23Updated 6 months ago
- A simple github actions script to build a llamafile and uploads to huggingface☆15Updated last year
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆118Updated 7 months ago
- ColBERT for live vector indexes☆28Updated 11 months ago
- Radio is a DuckDB extension by Query.Farm that brings real-time event streams into your SQL workflows. It enables DuckDB to receive and s…☆30Updated this week
- An open source MCP proxy.☆15Updated 9 months ago
- ☆30Updated 6 months ago
- AirLLM 70B inference with single 4GB GPU☆14Updated 3 months ago
- Rats is a collection of tools to help researchers define and run experiments. It is designed to be a modular and extensible framework cur…☆26Updated last week
- Tree-based indexes for neural-search☆32Updated last year
- Datasette plugin for searching all searchable tables at once☆25Updated last year