neuml / txtmarkerLinks
ποΈ Highlight text in documents
β108Updated 4 months ago
Alternatives and similar repositories for txtmarker
Users that are interested in txtmarker are comparing it to the libraries listed below
Sorting:
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β83Updated 7 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Python package for deduplication/entity resolution using active learningβ81Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β33Updated 4 months ago
- π Datasets and models for instruction-tuningβ238Updated last year
- Generalist and Lightweight Model for Text Classificationβ155Updated 2 months ago
- Clean, filter and sample URLs to optimize data collection β Python & command-line β Deduplication, spam, content and language filtersβ143Updated 7 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessingβ72Updated last year
- Python API for https://vespa.ai, the open big data serving engineβ136Updated this week
- Information extraction from English and German texts based on predicate logicβ138Updated 2 years ago
- Repository for deepdoctection tutorial notebooksβ46Updated 2 months ago
- Pre-train Static Word Embeddingsβ85Updated 2 months ago
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.β88Updated last year
- Efficient few-shot learning with cross-encoders.β56Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β113Updated last month
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ103Updated last year
- Blazing fast fuzzy text search for Python.β45Updated 4 months ago
- This is the repo for the container that holds the models for the text2vec-transformers moduleβ54Updated 2 weeks ago
- Streamlit component for Jina neural searchβ42Updated 3 years ago
- Aim-spaCy integrationβ34Updated 2 years ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ27Updated last year
- 𦦠weasel: A small and easy workflow systemβ85Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ115Updated last year
- Command Line Interface for Hugging Face Inference Endpointsβ66Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β80Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ77Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated last year
- π’ Work with static vector modelsβ28Updated 4 months ago
- π« SpaCy wrapper for ConceptNet π«β93Updated 2 years ago