Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)
☆398Dec 10, 2025Updated 3 months ago
Alternatives and similar repositories for muvera-py
Users that are interested in muvera-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 8 months ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Jun 9, 2025Updated 9 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated 9 months ago
- AutoRAG example about benchmarking Korean embeddings.☆43Oct 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆19May 16, 2024Updated last year
- A tool for manual conversion of BGE-M3 models with preserved trainable variables and direct control over model outputs.☆44Sep 7, 2025Updated 6 months ago
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 9 months ago
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- ☆53Jul 10, 2025Updated 8 months ago
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆31Jul 12, 2025Updated 8 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- Contextualized per-token embeddings☆34May 11, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆39Mar 11, 2025Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Updated this week
- Auto Thinking Mode switch for Qwen3 in Open webui☆71May 8, 2025Updated 10 months ago
- Bayesian probability transforms for BM25 retrieval scores☆65Updated this week
- MoLing is a computer-use and browser-use based MCP server. It is a locally deployed, dependency-free office AI assistant.☆332Mar 15, 2026Updated 2 weeks ago
- LLM-as-SERP☆68Mar 3, 2025Updated last year
- Wanna breeze through some papers?☆95Mar 17, 2026Updated last week
- Plug-and-play document AI with zero-shot models.☆125Feb 16, 2026Updated last month
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A flexible, adaptive classification system for dynamic text classification☆540Oct 7, 2025Updated 5 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆15Apr 26, 2025Updated 11 months ago
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆18Feb 13, 2026Updated last month
- ☆12Dec 20, 2024Updated last year
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 10 months ago
- The open-source RAG platform: built-in citations, deep research, 22+ file formats, partitions, MCP server, and more.☆1,929Mar 21, 2026Updated last week
- A Deep Research agent from scratch☆219May 18, 2025Updated 10 months ago
- It shows how to deploy and use an agent with LLM.☆19Mar 1, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Late Interaction Models Training & Retrieval☆770Mar 6, 2026Updated 3 weeks ago
- 구글에서 발표한 Chain-of-Thought Reasoning without Prompting을 코드로 구현한 레포입니다.☆65Sep 28, 2024Updated last year
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- ☆14Jul 7, 2024Updated last year
- ☆15Mar 18, 2026Updated last week
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- Adaptive Reasoning Engine for Efficient and Context-Aware Intelligence☆43Nov 15, 2025Updated 4 months ago