eliask / pdfssa4metLinks
PDF Structure and Syntactic Analysis for Metadata Extraction and Tagging - https://code.google.com/p/pdfssa4met/
☆19Updated 12 years ago
Alternatives and similar repositories for pdfssa4met
Users that are interested in pdfssa4met are comparing it to the libraries listed below
Sorting:
- Formal concept analysis lattice generation and query in Python☆13Updated 11 years ago
- Functional and structural analysis of tables in research papers (Table disentangling)☆20Updated 7 years ago
- Python module for bibliographic network analysis.☆85Updated 4 years ago
- ☆40Updated 7 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- modification of bibliotools 2.2 from Sébastian Grauwin☆11Updated 6 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆64Updated last year
- A Named-Entity Recogniser based on Grobid.☆53Updated last month
- Open Access PDF harvester☆40Updated last year
- Disambiguating biomedical and clinical concepts with word embeddings☆14Updated 7 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Python text processing, pattern matching, and NLP framework☆66Updated last year
- Bibliographic Entity Automatic Recognition and Disambiguation☆65Updated 4 years ago
- Processing OpenCitations Data☆20Updated 7 years ago
- An open-source CRF Reference String Parsing Package☆158Updated 5 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Updated last year
- SpExtor: Sparse Entity Extractor☆11Updated 5 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆61Updated last year
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆95Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Using word embeddings (word2vec) for ontology learning☆19Updated 8 years ago
- Python library that classifies content from scientific papers with the topics of the Computer Science Ontology (CSO).☆90Updated 6 months ago
- Named entity recognition for the legal domain☆42Updated 4 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- sequence tagging with spaCy and crfsuite☆20Updated 2 years ago
- Extraction Toolkit☆83Updated 3 years ago