neuml / txtmarkerLinks
ποΈ Highlight text in documents
β111Updated 9 months ago
Alternatives and similar repositories for txtmarker
Users that are interested in txtmarker are comparing it to the libraries listed below
Sorting:
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β86Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessingβ75Updated 2 years ago
- OCR, Archive, Index and Search: Implementation agnostic OCR framework.β224Updated 2 years ago
- A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.β92Updated last year
- Repository for deepdoctection tutorial notebooksβ50Updated last month
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROβ¦β53Updated 10 months ago
- π Datasets and models for instruction-tuningβ238Updated 2 years ago
- Python package for deduplication/entity resolution using active learningβ83Updated last year
- GLiNER model in a FastAPI microservice.β47Updated last year
- Streamlit component for Jina neural searchβ42Updated 4 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- Efficient few-shot learning with cross-encoders.β62Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β81Updated 2 years ago
- 𦦠weasel: A small and easy workflow systemβ90Updated 2 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β121Updated 2 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β111Updated last year
- A pythonic library providing light-weighted interface with LLMsβ131Updated 8 months ago
- Generalist and Lightweight Model for Text Classificationβ169Updated 2 weeks ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β34Updated 5 months ago
- Clean, filter and sample URLs to optimize data collection β Python & command-line β Deduplication, spam, content and language filtersβ158Updated last month
- Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pβ¦β87Updated last year
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17Updated 2 years ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Evaluation framework for document processing models and services.β63Updated this week
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex β¦β81Updated 4 months ago
- β20Updated 4 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ103Updated 2 years ago
- π’ Work with static vector modelsβ36Updated 9 months ago