kent37 / guess-language
Automatically exported from code.google.com/p/guess-language
☆53Updated 7 months ago
Related projects: ⓘ
- An index data structure for approximate string search.☆23Updated 5 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 7 years ago
- Text readability metrics in Python.☆12Updated 11 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- Markdown -> IPython conversion tool☆15Updated 9 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- Scrapes some Finnish word definitions from English Wiktionary.☆7Updated last year
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆97Updated 3 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- Python port for IWNLP.Lemmatizer☆17Updated 11 months ago
- A web application for exploring documents topically.☆26Updated 8 years ago
- ☆21Updated this week
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated last year
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 11 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 6 months ago
- Find which links on a web page are pagination links☆29Updated 7 years ago
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- A Python library for generating word tree diagrams☆24Updated 4 years ago
- ☆50Updated last year
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 7 years ago
- extract difference between two html pages☆32Updated 6 years ago
- Python bindings to the Compact Language Detector☆32Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 4 months ago
- Python wrapper for Apache OpenNLP tools☆34Updated 7 years ago
- Scrapy project with spiders to extract article content from various german news sites☆21Updated 11 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 6 years ago
- Python bindings for libwapiti☆66Updated 4 years ago