instituutnederlandsetaal / BlackLabLinks
Linguistic search for large annotated text corpora, based on Apache Lucene
☆115Updated this week
Alternatives and similar repositories for BlackLab
Users that are interested in BlackLab are comparing it to the libraries listed below
Sorting:
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 8 months ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 4 years ago
- BlackLab Frontend, a feature-rich corpus search interface for BlackLab.☆22Updated last week
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated 4 months ago
- linguistics backend☆41Updated 2 years ago
- Multi Tier Annotation Search☆12Updated last year
- ☆32Updated 2 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆18Updated last week
- Python package for stylometry☆63Updated 4 years ago
- This repository contains the Framester resource, the main outcome of the framester project.☆33Updated 5 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 7 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 4 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignment☆42Updated 8 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆75Updated 3 weeks ago
- High-performance text aligner for large collections of texts☆52Updated this week
- System for building, visualizing, and working with LDA topic models☆97Updated 2 months ago
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆249Updated 2 years ago
- Anafora is a web-based raw text annotation tool☆244Updated 3 years ago
- Open Access PDF harvester☆42Updated last year
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Software for multi-level annotation of linguistic corpora☆17Updated 5 years ago
- An open-source CRF Reference String Parsing Package☆160Updated 5 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆111Updated 4 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago