knmnyn / ParsCitLinks
An open-source CRF Reference String Parsing Package
☆160Updated 5 years ago
Alternatives and similar repositories for ParsCit
Users that are interested in ParsCit are comparing it to the libraries listed below
Sorting:
- Neuralized version of the Reference String Parser component of the ParsCit package.☆81Updated 3 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated 2 weeks ago
- High-level build project for all LAPDF-Text submodules☆103Updated 10 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Updated 10 years ago
- Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. …☆60Updated 7 years ago
- Bibliographic Entity Automatic Recognition and Disambiguation☆65Updated 5 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆119Updated last week
- Text Re-use Alignment Visualization☆38Updated 8 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated 2 years ago
- Download metadata for all DOIs using the Crossref API☆66Updated 7 years ago
- Python module for bibliographic network analysis.☆86Updated 5 years ago
- Content ExtRactor and MINEr☆511Updated 3 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 6 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 5 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- ☆40Updated 7 years ago
- Functional and structural analysis of tables in research papers (Table disentangling)☆20Updated 8 years ago
- Repository for the allofplos project.☆66Updated 8 months ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated 2 months ago
- Quickly extract multi-word phrases from a corpus☆195Updated 5 years ago
- Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.☆74Updated 8 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 8 months ago
- ☆98Updated 4 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Quantitative Text Analysis for the digitale Geisteswissenschaften☆47Updated 10 years ago
- Extraction Toolkit☆83Updated 4 years ago
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆129Updated 7 years ago