Plug-and-play document AI with zero-shot models.
☆125Feb 16, 2026Updated last month
Alternatives and similar repositories for sieves
Users that are interested in sieves are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- Modular Rust transformer/LLM library using Candle☆38May 5, 2024Updated last year
- spaCy entry points for Curated Transformers☆32Mar 27, 2026Updated last week
- Efficient BM25 with DuckDB 🦆☆65Dec 20, 2024Updated last year
- Pre-train Static Word Embeddings☆98Mar 27, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Next-generation Punkt sentence boundary detection with zero dependencies☆30Nov 18, 2025Updated 4 months ago
- Wrapper for the macOS signpost API☆16Apr 24, 2023Updated 2 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆84Feb 10, 2026Updated last month
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- Load embeddings and featurize your sentences.☆31Oct 23, 2024Updated last year
- 🔢 Work with static vector models☆38Apr 21, 2025Updated 11 months ago
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- spaCy extension for Visual Studio Code☆32Mar 10, 2025Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Aug 15, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Customize, control, and enhance LLM generation with logits processors, featuring visualization capabilities to inspect and understand sta…☆46Jan 8, 2026Updated 3 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆56Jul 31, 2023Updated 2 years ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Sep 5, 2024Updated last year
- Kernel sources for https://huggingface.co/kernels-community☆90Updated this week
- Evaluation framework for document processing models and services.☆67Apr 2, 2026Updated last week
- Synthetic Text Dataset Generation for LLM projects☆58Mar 26, 2026Updated 2 weeks ago
- ☆69Mar 17, 2022Updated 4 years ago
- ☆15May 8, 2019Updated 6 years ago
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆17May 21, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- KL3M training data collection and preprocessing☆21Apr 14, 2025Updated 11 months ago
- ☆23Jan 2, 2023Updated 3 years ago
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- simple grpo☆12May 28, 2025Updated 10 months ago
- Interactive, searchable organiastion charts in d3.js☆21Mar 11, 2018Updated 8 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆105Apr 23, 2024Updated last year
- The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.☆16May 22, 2024Updated last year
- Retired repository for Machine Learning utils at the Wellcome Trust (now deprecated).☆31Aug 9, 2023Updated 2 years ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆71Feb 20, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆30Jun 23, 2022Updated 3 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Python driver for MobilityDB☆11Apr 12, 2023Updated 2 years ago
- A spaCy wrapper for GliNER☆133Jan 29, 2025Updated last year
- Legal Matter Standard Specification (LMSS) library for Python☆17Nov 14, 2023Updated 2 years ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆80Oct 22, 2023Updated 2 years ago
- ☆162Dec 2, 2024Updated last year