louisgeisler / Doc2Map
Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar docs.
☆29Updated last year
Alternatives and similar repositories for Doc2Map:
Users that are interested in Doc2Map are comparing it to the libraries listed below
- Finds linguistic patterns effortlessly☆35Updated last year
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- Vectorizers for a range of different data types☆99Updated last week
- ☆54Updated last year
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- Pipeline components that support partial_fit.☆45Updated 7 months ago
- Collection of public APIs for embedding scientific papers☆56Updated 3 years ago
- Blue Brain text mining toolbox for semantic search and structured information extraction☆44Updated last year
- ☆17Updated 2 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated 10 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆43Updated 8 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆65Updated last week
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 9 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆19Updated 3 years ago
- A library of tools for dictionary-based Named Entity Recognition (NER), based on word vector representations to expand dictionary terms.☆24Updated last year
- Explaining dimensionality results using SHAP values☆53Updated last month
- Package to help with scientific literature research☆25Updated 2 years ago
- Implementation of the Paper "Towards an Automated Argument Mining Pipeline to Transform Plain Text to Argument Graphs"☆22Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Fuzzy Topic Models☆26Updated 9 months ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆14Updated 6 months ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 9 months ago
- Python package for deduplication/entity resolution using active learning☆76Updated 5 months ago
- Python framework for graph analytics and co-occurrence analysis☆32Updated 7 months ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago