neuml / txtmarkerLinks
ποΈ Highlight text in documents
β109Updated 2 months ago
Alternatives and similar repositories for txtmarker
Users that are interested in txtmarker are comparing it to the libraries listed below
Sorting:
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- Python package for deduplication/entity resolution using active learningβ80Updated 10 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- β43Updated 2 years ago
- Streamlit component for Jina neural searchβ41Updated 3 years ago
- Aim-spaCy integrationβ34Updated last year
- Efficient few-shot learning with cross-encoders.β52Updated last year
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ26Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β32Updated 2 months ago
- Pre-train Static Word Embeddingsβ79Updated 3 weeks ago
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ13Updated 10 months ago
- Repository for deepdoctection tutorial notebooksβ45Updated this week
- Generalist and Lightweight Model for Text Classificationβ133Updated last week
- Information extraction from English and German texts based on predicate logicβ137Updated 2 years ago
- π’ Work with static vector modelsβ28Updated 2 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β82Updated 5 months ago
- Python API for https://vespa.ai, the open big data serving engineβ127Updated this week
- NeatText a simple NLP package for cleaning textual data and text preprocessingβ72Updated last year
- Few-shot Named Entity Recognitionβ123Updated 3 years ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.β25Updated 4 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β109Updated last year
- π€ Trade any tensors over the networkβ30Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β31Updated 10 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β155Updated last year
- Clean, filter and sample URLs to optimize data collection β Python & command-line β Deduplication, spam, content and language filtersβ140Updated 5 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROβ¦β50Updated 3 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β36Updated 3 years ago
- This is the repo for the container that holds the models for the text2vec-transformers moduleβ51Updated 2 months ago
- Topic Inference with Zeroshot modelsβ61Updated 2 years ago