huggingface / dedupe_estimator
Chunk Dedupe Estimation
☆14Updated 5 months ago
Alternatives and similar repositories for dedupe_estimator:
Users that are interested in dedupe_estimator are comparing it to the libraries listed below
- ☆13Updated last year
- Smart reproducible analytical pipeline inspection☆12Updated this week
- Rust crates for XetHub☆42Updated 6 months ago
- Generate Python docstrings automatically with LLM and syntax trees☆14Updated this week
- Feature selection for tabular datasets using advanced filter and wrapper methods☆17Updated last month
- xet client tech, used in huggingface_hub☆86Updated this week
- Adding Marimo to Datasette☆20Updated last month
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆22Updated last month
- An open source MCP proxy.☆8Updated 3 months ago
- Hybrid Search (BM25 & Vector) with SQLite☆15Updated 8 months ago
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆16Updated 4 months ago
- Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hub☆17Updated last year
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- LLM plugin for asking questions of LLM's own documentation, and related packages☆14Updated last week
- A simple github actions script to build a llamafile and uploads to huggingface☆14Updated last year
- tsellm: LLMs in SQLite and DuckDB☆23Updated 8 months ago
- ☆15Updated 3 weeks ago
- A Python framework for building AI agent systems with robust task management in the form of a graph execution engine, inference capabilit…☆23Updated this week
- FalkorDB-Browser is a visualization UI for FalkorDB.☆30Updated this week
- This repository contains statistics about the AI Infrastructure products.☆18Updated last month
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- ☆17Updated 10 months ago
- ☆26Updated last week
- A Python Client for Hive Metastore☆12Updated last year
- A Model Context Protocol (MCP) server for interacting with Kong Konnect APIs, allowing AI assistants to query and analyze Kong Gateway co…☆23Updated 2 weeks ago
- First token cutoff sampling inference example☆30Updated last year
- Voyage AI Official Python Library☆58Updated 4 months ago
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆19Updated 11 months ago
- A fork of llama3.c used to do some R&D on inferencing☆21Updated 4 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year