TFIDF / KNN based string matching
☆56Apr 6, 2023Updated 3 years ago
Alternatives and similar repositories for tfidf_matcher
Users that are interested in tfidf_matcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Locate and tag named entities in text☆25Sep 17, 2025Updated 8 months ago
- Multi-platform native package builder toolkit☆15Aug 18, 2025Updated 9 months ago
- ☆39Sep 26, 2020Updated 5 years ago
- Detect and invoke build systems☆23May 4, 2026Updated 3 weeks ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆17Jul 18, 2024Updated last year
- Linear-time sampling algorithm for geometric inhomogeneous random graphs with a special case implementation for hyperbolic random graphs.☆16Dec 4, 2023Updated 2 years ago
- Sentence transformers models for SpaCy☆108Mar 9, 2023Updated 3 years ago
- Concatenate videos for playback by videojs/http-streaming in a Video.js player☆12May 18, 2021Updated 5 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆71Feb 3, 2022Updated 4 years ago
- ☆17Jan 29, 2020Updated 6 years ago
- R interface to Spark TensorFlow Connector☆13Sep 13, 2021Updated 4 years ago
- Population and demographics projection module, developed for ITRC/MISTRAL☆13Dec 8, 2022Updated 3 years ago
- LazyText is inspired by the idea of lazypredict, a library which helps build lot of basic models without much code. LazyText is for text …☆18Feb 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆13Sep 21, 2022Updated 3 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…☆10Nov 3, 2023Updated 2 years ago
- Reticulate wrapper for hyperopt☆11Jan 14, 2020Updated 6 years ago
- Work in progress - Module to Convert a json sequence into an FCPX XML. For BBC News Labs digital paper edit project☆26Jul 8, 2023Updated 2 years ago
- Population prediction model based on extracted OSM features☆15Dec 8, 2022Updated 3 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Dataset of 4.6m GitHub repository names☆16Jul 3, 2016Updated 9 years ago
- Fast and thread safe C++11 implementation of of the Aho-Corasick algorithm.☆10Mar 4, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative☆11Aug 6, 2025Updated 9 months ago
- A “Hello World” of calling Rust code from a Python program with CFFI, in order to show packaging issues☆11Jul 14, 2016Updated 9 years ago
- Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.☆13Jan 5, 2023Updated 3 years ago
- MERLIN is a global, model-agnostic, contrastive explainer for any tabular or text classifier. It provides contrastive explanations of how…☆19Sep 15, 2023Updated 2 years ago
- Wikimedia Pageview API client☆29May 17, 2026Updated last week
- ☆17Jun 23, 2020Updated 5 years ago
- spaCy match and replace, maintaining conjugation☆35Dec 9, 2022Updated 3 years ago
- LIDA: Lightweight Interactive Dialogue Annotator (in EMNLP 2019)☆10Oct 18, 2021Updated 4 years ago
- making printf work for you☆16Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Platform for making incremental changes to code in VCSes☆14May 6, 2026Updated 2 weeks ago
- ☆26Jan 5, 2023Updated 3 years ago
- Study Guide for Microsoft 70-773☆12Jun 21, 2017Updated 8 years ago
- DartMinHash: Fast Sketching for Weighted Sets☆12Dec 8, 2025Updated 5 months ago
- ☆10Oct 19, 2020Updated 5 years ago
- A library for parsing security advisories☆13Apr 13, 2026Updated last month
- Implementation of DeepMind's "Sobolev Training for Neural Networks"☆11Apr 2, 2018Updated 8 years ago