softwaredoug / searcharray
Full text search in your Pandas dataframe
☆209Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for searcharray
- Neural Search☆344Updated 5 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆162Updated 2 months ago
- Neural Search☆325Updated 5 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆287Updated last year
- Extra product metadata for the Amazon ESCI dataset☆35Updated last year
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆154Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆183Updated last month
- Late Interaction Models Training & Retrieval☆165Updated this week
- Python API for https://vespa.ai, the open big data serving engine☆103Updated this week
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆478Updated 4 months ago
- ☆44Updated last week
- Labelling platform for text using weak supervision.☆260Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆29Updated last month
- ☆66Updated this week
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- HNSW tutorial☆113Updated 9 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆103Updated 6 months ago
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆350Updated 2 months ago
- Visualize text embeddings☆33Updated last year
- CuVS integration for Lucene☆29Updated 5 months ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated this week
- The codebase for the book "AI-Powered Search" (Manning Publications, 2024)☆193Updated this week
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆308Updated 5 months ago
- ☆108Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆899Updated last week
- provides a common interface to many IR measure tools☆78Updated 3 weeks ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- My personal frontpage app☆78Updated this week
- ☆42Updated last year