Load embeddings and featurize your sentences.
☆31Oct 23, 2024Updated last year
Alternatives and similar repositories for reach
Users that are interested in reach are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Apr 28, 2020Updated 5 years ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographica…☆22Aug 26, 2024Updated last year
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Jun 9, 2025Updated 9 months ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31May 19, 2021Updated 4 years ago
- 🔢 Work with static vector models☆38Apr 21, 2025Updated 11 months ago
- Clinical spelling correction with word and character n-gram embeddings.☆75Jun 21, 2022Updated 3 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- Check-Worthiness Detection in Dutch☆14Oct 25, 2024Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Unsupervised concept extraction from clinical text☆14Jun 17, 2024Updated last year
- Bayesian probability transforms for BM25 retrieval scores☆62Updated this week
- Datamodels for hugging face tokenizers☆104Mar 12, 2026Updated last week
- benchmarks for LLM tokenizers☆17Feb 27, 2026Updated 3 weeks ago
- 🍺 a Homebrew keg that specialized in Natural Language Processing.☆22May 23, 2018Updated 7 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- Essential NLP & ML, short & fast pure Python code☆79Sep 17, 2025Updated 6 months ago
- A framework to compare entity linking systems.☆38Jul 29, 2018Updated 7 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Nanoloop source files for the album "Prime 16"☆11Mar 7, 2026Updated 2 weeks ago
- ☆12Jul 6, 2023Updated 2 years ago
- The H3 Compressor: A compression scheme tailored for H3 cell indexes.☆16Mar 25, 2024Updated last year
- ☆12Sep 1, 2021Updated 4 years ago
- Supervised and unsupervised self-organising maps☆12Mar 11, 2026Updated last week
- Application for Math formula detection in image/pdf and then recognition☆12Jan 14, 2025Updated last year
- Model implementation for the contextual embeddings project☆43Jun 2, 2025Updated 9 months ago
- Python library for fitting massive mixture models using DP priors and GPU computation.☆23Apr 7, 2016Updated 9 years ago
- 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)☆11Feb 22, 2024Updated 2 years ago
- implementation of aided LLM codeplan algorithm in java☆10Jan 13, 2024Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- Echo State Network☆17May 2, 2014Updated 11 years ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Code repo for the ICML 2021 paper "Making Paper Reviewing Robust to Bid Manipulation Attacks".☆10Sep 15, 2021Updated 4 years ago
- Bad link reporter for GitHub repositories☆13Mar 25, 2024Updated last year
- ☆12Apr 17, 2021Updated 4 years ago
- Annotato is a React component that helps to annotate or display and add interactivity to previously made annotations in a given text.☆12Aug 15, 2023Updated 2 years ago
- ☆44Nov 30, 2017Updated 8 years ago
- ☆11Dec 21, 2022Updated 3 years ago
- Plagiarism checker plugin for OJS 3/OMP☆14Mar 3, 2026Updated 3 weeks ago