dereckson / extract-proper-nounsLinks
Extract proper nouns from an English text with NLTK POS tagging
☆22Updated 7 years ago
Alternatives and similar repositories for extract-proper-nouns
Users that are interested in extract-proper-nouns are comparing it to the libraries listed below
Sorting:
- Convert Wiktionary entries to various formats such as StarDict or DB (MariaDB/MySQL). This used to be the main repository for this projec…☆15Updated 3 years ago
- Serving content from a WARC☆62Updated 13 years ago
- source man pages for explainshell.com☆17Updated 10 years ago
- Attempts to determine the natural language of a selection of Unicode (utf-8) text (a clone of http://code.google.com/p/guess-language wit…☆48Updated 15 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆44Updated 4 months ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- A small command-line utility that allows you to download closed captions from YouTube as a SRT file.☆30Updated 9 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 11 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- Shell scripts to assist downloading & processing the Google n-grams corpora☆14Updated 8 years ago
- A list of tools related to W(eb)ARC(hive)☆67Updated 11 years ago
- A Python script to speech some text with Google Translate.☆23Updated 12 years ago
- Language checker and hyphenator extension for LibreOffice☆12Updated 6 years ago
- A collection of small scripts to do various things☆32Updated 10 years ago
- Resources for conservation, development, and documentation of low resource (human) languages.☆432Updated 9 months ago
- track changes to the news, where news is anything with an RSS feed☆182Updated 5 years ago
- iPython-based tutorial in Noun Phrase chunking with the NLTK. Written to accompany PyCon 2015 poster presentation.☆17Updated 10 years ago
- Automatically exported from code.google.com/p/guess-language☆54Updated 3 months ago
- A dynamically generated thesaurus using Syntactic N-grams parsed by Google Research. Rather than providing synonyms, this thesaurus provi…☆15Updated 12 years ago
- Lexical lemmatizer of italian text☆13Updated 8 years ago
- Command-line interface for After the Deadline language checker☆106Updated 6 years ago
- automate incrementally producing word pronunciation recordings for Wiktionary through Wikimedia Commons☆22Updated 7 years ago
- An offline/online field database which adapts to its user's terminology and I-Language. http://fielddb.github.io☆82Updated this week
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated last week
- A simple interface to the Project Gutenberg corpus.☆331Updated 3 years ago
- Python library for reading and writing warc files☆247Updated 3 years ago
- Greek treebank from the Perseus Digital Library☆12Updated 9 years ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 10 years ago