huggingface / dedupe_estimatorLinks
Chunk Dedupe Estimation
☆20Updated last year
Alternatives and similar repositories for dedupe_estimator
Users that are interested in dedupe_estimator are comparing it to the libraries listed below
Sorting:
- Rust crates for XetHub☆75Updated last year
- ☆12Updated last year
- Smart reproducible analytical pipeline inspection☆21Updated last month
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Updated 9 months ago
- Radio is a DuckDB extension by Query.Farm that brings real-time event streams into your SQL workflows. It enables DuckDB to receive and s…☆34Updated 2 weeks ago
- ColBERT for live vector indexes☆28Updated last year
- tsellm: LLMs in SQLite and DuckDB☆24Updated 7 months ago
- FalkorDB-Browser is a visualization UI for FalkorDB.☆78Updated this week
- Like grep but with natural language queries☆50Updated last year
- Blueprint by Mozilla.ai for answering questions about structured documents☆36Updated 9 months ago
- Your buddy in the (L)LM space.☆64Updated last year
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆21Updated last year
- See how HTTPX, Requests, and AIOHTTP libraries compare for sending network requests and find out which one may fit your case better.☆20Updated 2 months ago
- Embedding models from Jina AI☆65Updated last year
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆39Updated 3 weeks ago
- An open source MCP proxy.☆16Updated 11 months ago
- First token cutoff sampling inference example☆31Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- Code and data for the Walert large language model-based chatbot☆12Updated 4 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆128Updated 10 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- ☆21Updated last year
- Python module for running GPTScript☆13Updated this week
- A simple github actions script to build a llamafile and uploads to huggingface☆15Updated last year
- Python SDK for XetHub☆60Updated last year
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆16Updated 3 years ago
- Hybrid Search (BM25 & Vector) with SQLite☆24Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated last year
- PuppyGraph standalone web server for visualize graph queries.☆45Updated 9 months ago
- Datasette plugin for searching all searchable tables at once☆27Updated last month