worldwise001 / stylometry
Stylometric framework in Python
☆13Updated 9 years ago
Alternatives and similar repositories for stylometry:
Users that are interested in stylometry are comparing it to the libraries listed below
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- MetroMaps Release☆16Updated 10 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- 💫 Runtime performance comparison of spaCy against other NLP libraries☆20Updated 2 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆198Updated 11 years ago
- Take streaming tweets, extract hashtags & usernames, create graph, export graphml for Gephi visualisation☆34Updated 11 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 10 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- classify a job description (or noisy job title) into a ONET job title☆18Updated 8 years ago
- A python wrapper for Semaphore, a Shallow Semantic Parser that identifies roles in a text.☆12Updated 11 years ago
- Browser-based annotation tool for Framenet☆15Updated 10 years ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- A web application for exploring documents topically.☆26Updated 8 years ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 5 years ago
- bigram / trigram analysis of wikipedia; mainly mutual info☆22Updated 12 years ago
- Opinion miner based of Machine Learning that can be trained on a corpus of KAF/NAF files☆9Updated 6 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Second project for UW LING 572. Automatic text summarization system.☆13Updated 11 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated 3 weeks ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago