Load embeddings and featurize your sentences.
☆31Oct 23, 2024Updated last year
Alternatives and similar repositories for reach
Users that are interested in reach are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variational inference and disentangled representations through unsupervised learning☆21Mar 2, 2020Updated 6 years ago
- ☆15Apr 28, 2020Updated 6 years ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographica…☆22Aug 26, 2024Updated last year
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆28Jun 5, 2026Updated last week
- Pre-train Static Word Embeddings☆106Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Clinical spelling correction with word and character n-gram embeddings.☆77Jun 21, 2022Updated 3 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- Check-Worthiness Detection in Dutch☆14Oct 25, 2024Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Uniform sampling on various geometric shapes☆10Jul 18, 2023Updated 2 years ago
- ☆13Nov 7, 2025Updated 7 months ago
- Plug-and-play document AI with zero-shot models.☆126May 11, 2026Updated last month
- Datamodels for hugging face tokenizers☆107May 26, 2026Updated 2 weeks ago
- benchmarks for LLM tokenizers☆18Mar 25, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Rust Implementation of Model2Vec☆191May 24, 2026Updated 3 weeks ago
- Numerical experiments for nested cross-validation paper☆14Jun 10, 2022Updated 4 years ago
- 🍺 a Homebrew keg that specialized in Natural Language Processing.☆22May 23, 2018Updated 8 years ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆66Feb 6, 2025Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆22Mar 23, 2026Updated 2 months ago
- Essential NLP & ML, short & fast pure Python code☆79Mar 29, 2026Updated 2 months ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Aug 29, 2019Updated 6 years ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆77Apr 27, 2026Updated last month
- A new tool for harmonizing volumetric MRI data from unseen scanners (Garcia-Dias et al. 2020)☆18Jan 6, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A framework to compare entity linking systems.☆38Jul 29, 2018Updated 7 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Aug 4, 2019Updated 6 years ago
- FlexiTokens☆23Dec 27, 2025Updated 5 months ago
- ☆12Nov 17, 2018Updated 7 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- ☆16Dec 10, 2025Updated 6 months ago
- Fake news detection, Google Summer of Code 2017☆92May 2, 2018Updated 8 years ago
- Supervised and unsupervised self-organising maps☆13Mar 11, 2026Updated 3 months ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python library for fitting massive mixture models using DP priors and GPU computation.☆23Apr 7, 2016Updated 10 years ago
- Knowledge graph Entity and Word Embeddings for Retrieval☆11Nov 19, 2021Updated 4 years ago
- 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)☆11Feb 22, 2024Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆20Feb 7, 2023Updated 3 years ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Echo State Network☆17May 2, 2014Updated 12 years ago
- Code repo for the ICML 2021 paper "Making Paper Reviewing Robust to Bid Manipulation Attacks".☆10Sep 15, 2021Updated 4 years ago