conanhujinming / text_dedupLinks
High-Performance Text Deduplication Toolkit
☆61Updated 5 months ago
Alternatives and similar repositories for text_dedup
Users that are interested in text_dedup are comparing it to the libraries listed below
Sorting:
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Updated 10 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆51Updated 4 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- ☆51Updated 4 months ago
- ☆15Updated 9 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated this week
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆76Updated this week
- Editor with LLM generation tree exploration☆82Updated 11 months ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆57Updated last year
- Sparse Inferencing for transformer based LLMs☆218Updated 5 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆202Updated 4 months ago
- Load and run Llama from safetensors files in C☆15Updated last year
- Inference RWKV v7 in pure C.☆43Updated 3 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆48Updated 5 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated 2 weeks ago
- InferX: Inference as a Service Platform☆154Updated this week
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆21Updated 5 months ago
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆30Updated this week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆41Updated last year
- ☆42Updated 5 months ago
- ☆209Updated 3 weeks ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆281Updated 3 weeks ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆156Updated 6 months ago
- ☆108Updated 2 months ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Updated 10 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆29Updated last month
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆100Updated 7 months ago
- ☆27Updated 7 months ago
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Updated 2 months ago
- ☆178Updated 5 months ago