interactive semantic search demo using Qwen3-0.6B-Embedding in your browser
☆56Feb 25, 2026Updated last week
Alternatives and similar repositories for qwen3-semantic-search
Users that are interested in qwen3-semantic-search are comparing it to the libraries listed below
Sorting:
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆97Feb 15, 2026Updated 3 weeks ago
- FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).☆41Oct 6, 2025Updated 5 months ago
- Random llm scripts☆37Feb 25, 2026Updated last week
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆17Feb 22, 2026Updated 2 weeks ago
- Authenticated Knowledge & Trust Architecture for AI Agents☆30Dec 17, 2025Updated 2 months ago
- ☆15Apr 28, 2023Updated 2 years ago
- ☆17Dec 16, 2024Updated last year
- ☆24Aug 26, 2025Updated 6 months ago
- ☆37Aug 4, 2025Updated 7 months ago
- Structured assembly rewriting library/mod for RW☆17Feb 7, 2026Updated last month
- A tool for testing and comparing the performance of different Large Language Model APIs. 一个用于测试和比较不同大语言模型API性能的工具。☆40Dec 9, 2025Updated 3 months ago
- practical claude code commands and subagents☆70Jan 23, 2026Updated last month
- ☆58Sep 10, 2025Updated 5 months ago
- A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.☆32Oct 3, 2025Updated 5 months ago
- Dashboard v5 Coming Soon!!☆63Feb 15, 2026Updated 3 weeks ago
- documentation used in my projects☆16Mar 2, 2026Updated last week
- llama-swap + a minimal ollama compatible api☆51Feb 13, 2026Updated 3 weeks ago
- Efficient non-uniform quantization with GPTQ for GGUF☆61Sep 17, 2025Updated 5 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆52Aug 21, 2025Updated 6 months ago
- Maximum hackage☆10Feb 16, 2026Updated 3 weeks ago
- TK's Tree Style Tab Outliner -- Organize your online life and sync between browsers, using your own private server☆21Mar 2, 2026Updated last week
- MVC fastify decorator Dependency injection Inversion of Control Typescript☆11Jan 5, 2023Updated 3 years ago
- Ping Legacy gives a legacy experience to test ping to get connection status and quality to network or internet.☆20Feb 19, 2026Updated 2 weeks ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- A real-time shared memory layer for multi-agent LLM systems.☆57Jan 12, 2026Updated last month
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆65Aug 10, 2025Updated 6 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆30Updated this week
- Real-Time equilibrium reconstruction code☆15Updated this week
- Wordpress for Voice AI.☆59Feb 26, 2026Updated last week
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆24Feb 21, 2026Updated 2 weeks ago
- NAIJA OSINT INTEL is a comprehensive open-source intelligence gathering tool specifically designed for Nigerian cybersecurity professiona…☆19Aug 21, 2025Updated 6 months ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Oct 2, 2025Updated 5 months ago
- Convert Confluence MIME exports (.doc) to clean Markdown☆34Jan 13, 2026Updated last month
- CypherFlow.ai is a cutting-edge Open Source platform providing private, AI conversations powered by Bitcoin micropayments. Experience unc…☆19Aug 11, 2025Updated 6 months ago
- ☆17Aug 5, 2025Updated 7 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆51Feb 10, 2026Updated 3 weeks ago
- A transformer that decodes swipes across a smartphone keyboard into words (gesture / swipe / glide typing) (enhanced yandex cup solution)☆16Feb 20, 2026Updated 2 weeks ago
- A django-yolov5 starter webapp. Based on yolov5-flask example.☆11Mar 6, 2022Updated 4 years ago