SnowPilotOrg / dedupe_it
Simple fuzzy deduplication
☆23Updated last week
Related projects ⓘ
Alternatives and complementary repositories for dedupe_it
- ☆70Updated this week
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆154Updated this week
- Extremely memory-efficient vector database☆56Updated last month
- OMF is a compact, user-friendly specification that defines a lightweight API contract between client and server for building conversation…☆58Updated 2 months ago
- AI-assisted writing tool.☆64Updated 9 months ago
- Experimental jsxdom☆92Updated last month
- SyncLite : Build Anything Sync Anywhere☆143Updated 3 weeks ago
- Chat strategies for LLMs☆90Updated 2 months ago
- Fast similarity search using DuckDB☆106Updated 2 weeks ago
- Documentation for the Krixik Python client.☆37Updated this week
- A collection of tamagotchi characters to give AI assistants an identity.☆50Updated 2 weeks ago
- GUI for selecting text files for concatenation and submission to LLMs☆131Updated last week
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆77Updated last month
- A Chrome extension to extract any data from any website☆41Updated last month
- RΞASON is a minimalistic Typescript framework for building great LLM apps☆49Updated 3 months ago
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆20Updated last month
- Postgres extension that speeds up analytics queries by upto 90%☆48Updated 5 months ago
- Structured Output Is All You Need!☆48Updated 7 months ago
- 360M model running in the browser on WebGPU☆20Updated 2 months ago
- Command-line interface for the Arcane Engine☆43Updated 2 weeks ago
- Supercompat allows you to use any AI provider like Anthropic, Groq or Mistral with OpenAI-compatible Assistants API.☆54Updated 2 weeks ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆51Updated 2 weeks ago
- ☆35Updated 6 months ago
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆85Updated 5 months ago
- BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is …☆193Updated 10 months ago
- Analyzing hacker news in real-time with Bytewax and Proton☆38Updated 9 months ago
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆85Updated last week
- An electronic data capture platform for administering remote and in-person clinical instruments☆103Updated last week
- ☆26Updated 2 months ago