aphp / edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
☆38Updated last month
Related projects: ⓘ
- Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.☆112Updated last week
- EDS-Pseudo is a hybrid model for detecting personally identifying entities in clinical reports☆44Updated last month
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 4 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 5 months ago
- Python package for deduplication/entity resolution using active learning☆77Updated 3 weeks ago
- eds-scikit is a Python library providing tools to process and analyse OMOP data☆35Updated 2 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- ☆53Updated 8 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆53Updated last year
- communication sur le moteur de pseudonymisation de la Cour de Cassation☆17Updated last year
- Question Answering annotation platform - Plateforme d'annotation☆87Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆61Updated 6 months ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆42Updated 3 months ago
- A spaCy wrapper for GliNER☆77Updated 2 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆94Updated 4 months ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆16Updated 2 months ago
- DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains☆16Updated 7 months ago
- Tools for interactive visual exploration of semantic embeddings.☆24Updated 2 weeks ago
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆12Updated 5 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- spaCy entry points for Curated Transformers☆23Updated 2 weeks ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆77Updated this week
- A High-level Library for Named Entity Recognition in Python.☆23Updated 9 months ago
- Aim-spaCy integration☆34Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- link raw affiliation to ROR ids☆24Updated last year
- A Serverless Text Annotation Tool for Corpus Development☆50Updated 8 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆131Updated 3 months ago
- Blue Brain text mining toolbox for semantic search and structured information extraction☆40Updated last year