neuml / txtmarker
ποΈ Highlight text in documents
β107Updated 2 weeks ago
Alternatives and similar repositories for txtmarker:
Users that are interested in txtmarker are comparing it to the libraries listed below
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- π’ Work with static vector modelsβ28Updated 2 weeks ago
- Generalist and Lightweight Model for Text Classificationβ124Updated last week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ24Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β80Updated 4 months ago
- Python package for deduplication/entity resolution using active learningβ79Updated 8 months ago
- Repository for deepdoctection tutorial notebooksβ44Updated 5 months ago
- Efficient few-shot learning with cross-encoders.β51Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β30Updated 3 weeks ago
- Pre-train Static Word Embeddingsβ58Updated 3 weeks ago
- A spaCy wrapper for GliNERβ114Updated 3 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- β43Updated 2 years ago
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- β78Updated 2 years ago
- GLiNER model in a FastAPI microservice.β42Updated 4 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated 11 months ago
- Aim-spaCy integrationβ34Updated last year
- spaCy match and replace, maintaining conjugationβ35Updated 2 years ago
- A component orchestration engineβ28Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ102Updated last month
- Clean, filter and sample URLs to optimize data collection β Python & command-line β Deduplication, spam, content and language filtersβ137Updated 4 months ago
- 𦦠weasel: A small and easy workflow systemβ83Updated 10 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β153Updated 11 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β30Updated 8 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ71Updated 9 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.β78Updated last year