bb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.
☆141Mar 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for bb25
Users that are interested in bb25 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 2 months ago
- Question and answer retrieval in Turkish with BERT☆14Nov 30, 2021Updated 4 years ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- LangChain / LangGraph Q&A 에이전트☆35Apr 15, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆18Feb 13, 2026Updated 2 months ago
- Create offline "desktop" app using Next.js without Electron!☆20Jun 3, 2024Updated last year
- Wrinkl is an AI context management system with ledger-based feature tracking for better AI-assisted development☆33Jul 20, 2025Updated 8 months ago
- 2024 PyCon Korea 튜토리얼☆12Nov 8, 2024Updated last year
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆209Apr 4, 2026Updated last week
- ☆10Oct 24, 2024Updated last year
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Korean-MTEB☆81Mar 12, 2026Updated last month
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆79Mar 28, 2026Updated 2 weeks ago
- Data science with Pandas and NumPy: EDA, binning, distribution functions, simulations, regression analysis☆11Dec 26, 2024Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Mar 6, 2024Updated 2 years ago
- Clustered Compositional Embeddings☆12Oct 25, 2023Updated 2 years ago
- LLM 모델의 외국어 토큰 생성을 막는 코드 구현☆85Aug 7, 2025Updated 8 months ago
- K-HALU: Multiple Answer Korean Hallucination Benchmark for Large Language Models☆38Dec 30, 2025Updated 3 months ago
- This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.☆15Mar 8, 2022Updated 4 years ago
- Tool to migrate data into Qdrant☆74Mar 30, 2026Updated 2 weeks ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 깃헙에 NLP 잔디심기 시즌 5☆10Aug 19, 2024Updated last year
- A local chatbot for managing docs☆24Jun 14, 2025Updated 10 months ago
- Continual learning layer for coding agents☆61Mar 31, 2026Updated 2 weeks ago
- Generates and optimizes Haiku system and user prompts for classification☆15Oct 27, 2025Updated 5 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- GPU accelerated Perlin Noise in python☆11Oct 23, 2020Updated 5 years ago
- ☆22Oct 27, 2025Updated 5 months ago
- KEN: Relational Data Embeddings☆34Jan 2, 2024Updated 2 years ago
- 🇰🇷 Korean LLM Datasets | Pre-training, SFT, DPO, RLHF, CoT | 한국어 LLM 데이터셋 큐레이션☆36Jan 20, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A library to encode text as DNA and decode DNA to text.☆13Nov 21, 2022Updated 3 years ago
- Official Documentation for DSPy Library☆23Mar 27, 2026Updated 2 weeks ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆108Oct 5, 2024Updated last year
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆24Jul 6, 2024Updated last year
- ☆12Jun 17, 2023Updated 2 years ago
- Mistral-7B finetuned for function calling☆16Jan 28, 2024Updated 2 years ago
- Mini Callcenter Simulator simulates a call center and takes into account many parameters not covered by the Erlang C formula.☆12Updated this week