faridani / PyNLPLinks
... just because nltk is too heavy
☆35Updated 14 years ago
Alternatives and similar repositories for PyNLP
Users that are interested in PyNLP are comparing it to the libraries listed below
Sorting:
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Preprocess text for NLP (tokenizing, lowercasing, stemming, sentence splitting, etc.)☆29Updated 14 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- A Python implementation of the Double Metaphone algorithm☆61Updated 14 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆46Updated 6 years ago
- A visualizer for multi-dimensional semantic data☆38Updated 13 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- Crab - A recommendation engine library for Python☆87Updated 13 years ago
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 8 years ago
- Memory Mapped Stats Tools☆100Updated 11 years ago
- python-readability, but faster (mirror-ish)☆83Updated 13 years ago
- MapReduce platform in python☆34Updated 10 years ago
- feedparser but faster and worse☆104Updated 3 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 9 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- simple python datastructure wrappings for redis☆105Updated 4 years ago
- A Python framework for describing binary file formats☆62Updated 11 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 6 months ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Updated 14 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆193Updated 11 years ago
- Pythonic interface to redis-py☆98Updated 7 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- Python interface to Solr☆277Updated last year
- a python port of https://github.com/twitter/twitter-text-rb also available via `pip install twitter_text`☆82Updated 7 years ago
- Ultra simple API for geocoding a single string against various web services.☆183Updated 11 years ago
- csvcat☆22Updated 9 years ago
- Python library for creating word clouds from text☆51Updated 6 years ago