dohliam / more-stoplistsLinks
stoplists for African languages generated from the ASP corpus
☆14Updated 10 years ago
Alternatives and similar repositories for more-stoplists
Users that are interested in more-stoplists are comparing it to the libraries listed below
Sorting:
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- linguistics backend☆42Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- command-line tool to extract taxonomies from Wikidata☆129Updated 6 years ago
- Newsclipse: The IDE for news production.☆91Updated 11 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆41Updated 8 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- An offline/online field database which adapts to its user's terminology and I-Language. http://fielddb.github.io☆82Updated 2 weeks ago
- Website content for annotatorjs.org☆16Updated 5 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- Extract case law citations with Node☆59Updated 11 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆66Updated this week
- Open Access PDF harvester☆42Updated last year
- A visual timeline authoring tool that extracts temporal information from freeform text☆65Updated 2 years ago
- Text Thresher crowd sourced text annotator☆17Updated 8 years ago
- A full-stack publishing solution involving different technologies to power digital archives☆158Updated 5 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Updated 4 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆98Updated 4 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 10 years ago
- A Python library for topic modeling and visualization☆67Updated 5 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆99Updated 2 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆119Updated last week
- Python package for stylometry☆64Updated 4 years ago
- ☆99Updated 4 years ago
- bilingual dictionary extractor from parallel corpora☆23Updated 11 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated 2 weeks ago
- Diary for qualitative analysis☆28Updated last week
- Quickly extract multi-word phrases from a corpus☆195Updated 5 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆16Updated 2 years ago