conanhujinming / text_dedupLinks

High-Performance Text Deduplication Toolkit

☆59

Alternatives and similar repositories for text_dedup

Users that are interested in text_dedup are comparing it to the libraries listed below

Sorting:

abhisheknair10 / llama3.cu
Lightweight Llama 3 8B Inference Engine in CUDA C
☆48Updated 7 months ago
extopico / llama-server_mcp_proxy
Simple node proxy for llama-server that enables MCP use
☆13Updated 5 months ago
grctest / Electron-BitNet
Running Microsoft's BitNet via Electron, React & Astro
☆45Updated last month
google / minja
A minimalistic C++ Jinja templating engine for LLM chat templates
☆193Updated last month
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆81Updated last week
Ashx098 / sft-play
☆48Updated 3 weeks ago
codelion / ellora
Enhancing LLMs with LoRA
☆172Updated last week
ServiceStack / llms
llm client, server and agent
☆73Updated this week
blackhole89 / autopen
Editor with LLM generation tree exploration
☆77Updated 8 months ago
KevlarKanou / rwkv7.c
Inference RWKV v7 in pure C.
☆40Updated 3 weeks ago
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆201Updated 2 months ago
pierrel55 / llama_st
Load and run Llama from safetensors files in C
☆11Updated last year
gigit0000 / qwen3.c
Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.
☆17Updated last month
houtianze / audiobook-generator
☆13Updated 6 months ago
Thireus / GGUF-Tool-Suite
Input your VRAM and RAM and the toolchain will produce a GGUF model tuned to your system within seconds — flexible model sizing and lowes…
☆62Updated last week
kroggen / mamba.c
Inference of Mamba models in pure C
☆192Updated last year
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆145Updated 3 weeks ago
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆75Updated last year
RobinQu / instinct.cpp
instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…
☆53Updated last year
adriancable / qwen3.c
Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.
☆140Updated 3 months ago
iluxu / llmbasedos
llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work
☆276Updated 2 months ago
matteoserva / GraphLLM
☆206Updated last month
rafacelente / bllama
1.58-bit LLaMa model
☆83Updated last year
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆225Updated last year
huawei-csl / SINQ
Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …
☆555Updated this week
Infini-AI-Lab / UMbreLLa
LLM Inference on consumer devices
☆125Updated 7 months ago
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆97Updated 4 months ago
ngxson / ggml-easy
Thin wrapper around GGML to make life easier
☆40Updated 4 months ago
chigkim / Ollama-MMLU-Pro
☆104Updated 2 months ago
Toy-97 / Chat-WebUI
Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …
☆46Updated 2 months ago