interactive semantic search demo using Qwen3-0.6B-Embedding in your browser
☆58Feb 25, 2026Updated last month
Alternatives and similar repositories for qwen3-semantic-search
Users that are interested in qwen3-semantic-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Random llm scripts☆39Apr 12, 2026Updated last week
- Authenticated Knowledge & Trust Architecture for AI Agents☆32Dec 17, 2025Updated 4 months ago
- ☆17Dec 16, 2024Updated last year
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆109Feb 15, 2026Updated 2 months ago
- FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).☆43Oct 6, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Jul 4, 2025Updated 9 months ago
- Simple node proxy for llama-server that enables MCP use☆19May 10, 2025Updated 11 months ago
- ☆24Aug 26, 2025Updated 7 months ago
- A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.☆34Apr 9, 2026Updated last week
- llama-swap + a minimal ollama compatible api☆56Mar 14, 2026Updated last month
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆18Apr 10, 2026Updated last week
- Automated multi-account farming tool for Kite AI decentralized payment network with faucet claims, token staking, DEX swaps, daily quiz c…☆254Mar 13, 2026Updated last month
- Structured assembly rewriting library/mod for RW☆17Feb 7, 2026Updated 2 months ago
- Recursive Self-Aggregation evals on ARC-AGI☆31Jan 26, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MCP Server for Jaeger☆18May 13, 2025Updated 11 months ago
- MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. 93% token savings benchmarked. Works with LM…☆56Apr 6, 2026Updated last week
- ☆19Aug 23, 2025Updated 7 months ago
- Vector functions and indexing for SQLite☆10Mar 26, 2023Updated 3 years ago
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 7 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆31Mar 26, 2026Updated 3 weeks ago
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆63Aug 10, 2025Updated 8 months ago
- ProxCLMC is a lightweight tool to determine the maximum CPU compatibility level that is supported across all nodes in a Proxmox VE cluste…☆35Jan 15, 2026Updated 3 months ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆52Feb 10, 2026Updated 2 months ago
- ☆15Apr 28, 2023Updated 2 years ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆33Apr 1, 2026Updated 2 weeks ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Mar 6, 2026Updated last month
- Awesome AI Benchmarks☆29Jan 16, 2026Updated 3 months ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- ☆20Dec 9, 2025Updated 4 months ago
- Simple Python script to download the entirety of Wikipedia on a weekly basis.☆14May 2, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated last month
- Metadata Editor user and practice guide☆17Mar 11, 2026Updated last month
- A Monitoring and Status Page using Cloudflare Worker and Pages. Inspired by https://github.com/eidam/cf-workers-status-page☆12Oct 7, 2024Updated last year
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆33Feb 21, 2026Updated last month
- Verify that any MCP server is running the intended and untampered code via hardware attestation.☆18Mar 28, 2025Updated last year
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- ☆58Sep 10, 2025Updated 7 months ago