High-Performance Text Deduplication Toolkit
☆62Aug 25, 2025Updated 8 months ago
Alternatives and similar repositories for text_dedup
Users that are interested in text_dedup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a training method to produce a split brain model☆14Mar 7, 2025Updated last year
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Mar 8, 2025Updated last year
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 7 months ago
- Radix set/map implementation☆19Mar 15, 2024Updated 2 years ago
- Source code for DPTree: Differential Indexing for Persistent Memory☆61Jul 28, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 8 months ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Seamlessly bridge LM Studio and OpenWebUI with zero configuration☆26Aug 24, 2025Updated 8 months ago
- ☆43Aug 2, 2025Updated 9 months ago
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- ScribePal is an Open Source intelligent browser extension that leverages AI to empower your web experience by providing contextual insigh…☆21Apr 6, 2026Updated last month
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 8 months ago
- Code-regrouping to reduce latency in Julia code compilation☆15Mar 11, 2022Updated 4 years ago
- C++ client for rqlite☆13Oct 22, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆43Aug 3, 2025Updated 9 months ago
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆48Jan 18, 2026Updated 3 months ago
- ☆51Oct 1, 2025Updated 7 months ago
- A VS Code extension help you explore CPython internals☆14Mar 3, 2023Updated 3 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆68Aug 21, 2024Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 8 months ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- ☆31Sep 19, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.☆45Oct 6, 2025Updated 7 months ago
- ☆83Sep 9, 2025Updated 8 months ago
- experimental parallel json parser, scj3 (claujson)☆14Apr 4, 2026Updated last month
- ☆11Jan 21, 2019Updated 7 years ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 7 months ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- A text-grid web renderer for AI agents — see the web without screenshots☆83Mar 10, 2026Updated last month
- ☆12May 23, 2024Updated last year
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆22Apr 15, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A unique_ptr implementation with small object optimization☆20Feb 8, 2026Updated 3 months ago
- ☆18Jul 2, 2024Updated last year
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆189Apr 9, 2026Updated last month
- ☆15May 24, 2019Updated 6 years ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- House of the Paiperwork☆35Updated this week
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 6 months ago