SnowPilotOrg / dedupe_it
Simple fuzzy deduplication
☆24Updated 2 months ago
Alternatives and similar repositories for dedupe_it:
Users that are interested in dedupe_it are comparing it to the libraries listed below
- A Model Context Protocol (MCP) server implementation for DuckDB, providing database interaction capabilities☆36Updated 2 weeks ago
- Documentation for the Krixik Python client.☆38Updated 2 months ago
- Multi-model transactional embedded database☆67Updated last month
- Chat strategies for LLMs☆92Updated 5 months ago
- Mark web pages for use with vision-language models☆20Updated last week
- A Chrome extension to extract any data from any website☆47Updated 3 months ago
- Extremely memory-efficient vector database☆61Updated 4 months ago
- Migrate vector workloads to Postgres☆34Updated last month
- An experimental project to convert HTML websites into a format compatible with large language models (LLMs), enabling seamless website na…☆18Updated last month
- Supercompat allows you to use any AI provider like Anthropic, Groq or Mistral with OpenAI-compatible Assistants API.☆61Updated this week
- A bring-your-own-key browser extension for summarizing Hacker News articles with LLMs☆53Updated last month
- COBOL for serverless headless browsers☆24Updated 3 months ago
- Simple, opinionated, JSON-typed, and traced LLM framework for TypeScript.☆35Updated 10 months ago
- A monorepo for Jamsocket client libraries, the CLI, and examples☆34Updated last month
- Streamable multi-format serialization with schema☆22Updated last month
- ☆35Updated 8 months ago
- Postgres extension that speeds up analytics queries by upto 90%☆49Updated 7 months ago
- Fast similarity search using DuckDB☆115Updated 2 months ago
- create dynamic pipelines on github workflows☆16Updated last week
- Gato Prompt Language (GPL): A system for generating focused instructions and short-form outputs.☆20Updated 3 months ago
- Piazza-Updater automates updates to a Weaviate database with real-time vectorial data. By continuously searching the internet and integra…☆28Updated last month
- Semantic Code Search Using Vectorized Abstract Syntax Trees☆17Updated last year
- Excalichart is an open source BI tool for finding insights in your data.☆39Updated last year
- Experimental jsxdom☆93Updated 3 months ago
- Visual inference exploration & experimentation playground☆84Updated last month
- Team-oriented tldraw built w/ InstantDB☆49Updated 3 months ago
- Magna is an AI-powered embedding similarity search tool for searching within large documents.☆29Updated 2 weeks ago
- A collection of tamagotchi characters to give AI assistants an identity.☆53Updated 2 months ago
- AI-assisted writing tool.☆64Updated 11 months ago