maxdotio / mighty-batch
Highly concurrent and fast content processing for Mighty Inference Server
☆10Updated last year
Alternatives and similar repositories for mighty-batch:
Users that are interested in mighty-batch are comparing it to the libraries listed below
- utilities for loading and running text embeddings with onnx☆43Updated 5 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- LLM plugin for clustering embeddings☆66Updated 10 months ago
- ☆29Updated last year
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated 9 months ago
- This repository implements DSPy programs to tasks in Indian Languages☆11Updated last year
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 10 months ago
- Efficiently computing & storing token n-grams from large corpora☆17Updated 3 months ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Pre-train Static Word Embeddings☆42Updated this week
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Inference engine for GLiNER models, in Rust☆36Updated this week
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- A CLI tool for managing OpenAI batch processing jobs with ease.☆29Updated 5 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- Library for fast text representation and classification.☆28Updated last year
- hnsw implemented by python☆19Updated 5 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆30Updated 2 years ago
- Framework for Self-Organizing Python Agents☆29Updated 11 months ago
- ☆19Updated last year
- NLP with Rust for Python 🦀🐍☆60Updated 7 months ago
- ☆27Updated 4 months ago
- Latent Large Language Models☆17Updated 5 months ago
- Using modal.com to process FineWeb-edu data☆19Updated last month
- ☆18Updated 9 months ago