maxdotio / mighty-batch
Highly concurrent and fast content processing for Mighty Inference Server
☆10Updated 2 years ago
Alternatives and similar repositories for mighty-batch:
Users that are interested in mighty-batch are comparing it to the libraries listed below
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 11 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Library for fast text representation and classification.☆28Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- ☆30Updated 2 years ago
- History of Open-Source IR Systems☆11Updated 2 months ago
- Framework for Self-Organizing Python Agents☆29Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- ☆19Updated 6 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated 2 weeks ago
- Embedding models from Jina AI☆58Updated last year
- spaCy entry points for Curated Transformers☆27Updated 6 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- ☆19Updated 4 years ago
- hnsw implemented by python☆20Updated 5 years ago
- Pre-train Static Word Embeddings☆52Updated 3 weeks ago
- Local emulator for Hugging Face Inference Endpoints customer handlers☆25Updated last year
- Vespa application making an index of the CORD-19 dataset.☆39Updated 2 months ago
- Efficient BM25 with DuckDB 🦆☆44Updated 3 months ago
- Prototyping a question and answer bot over PDFs☆39Updated last year
- NLP with Rust for Python 🦀 🐍☆61Updated 10 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago
- LLM plugin for clustering embeddings☆72Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 7 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆24Updated 4 months ago