kevinmcmahon / tagger
A Python module for extracting relevant tags from text documents.
☆15Updated 13 years ago
Related projects ⓘ
Alternatives and complementary repositories for tagger
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 9 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- iCQA - Intelligent Community Question Answering Framework☆32Updated 8 years ago
- [hibernating] Dynamic topic models☆39Updated 9 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 6 years ago
- Knowledge extraction from web data☆92Updated 6 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 6 years ago
- A Python library for finding feed links on websites.☆50Updated 2 years ago
- implement some outlier detection algorithms☆11Updated 9 years ago
- yael (Yet Another EPUB Library) is a Python library for reading, manipulating, and writing EPUB 2/3 files☆17Updated 9 years ago
- A whoosh-based CLI indexer and searcher for your files.☆16Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 7 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆59Updated 6 years ago
- Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.☆15Updated 7 years ago
- Second project for UW LING 572. Automatic text summarization system.☆14Updated 11 years ago
- Paragraph Vector Implementation☆56Updated 7 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Extract data from an HTML table and store results to a csv file.☆38Updated 9 years ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- SearchBetter: query rewriting for search engines on small corpuses (Harvard research project)☆31Updated 7 years ago
- ☆21Updated 9 years ago
- Python search module for fast approximate string matching☆53Updated last year
- Extract, parse and populate templates from strings☆27Updated 5 years ago
- An LSTM based query classification for Mandrain, implemented using Tensorflow☆20Updated 8 years ago
- A python implementation of DEPTA☆83Updated 7 years ago
- Latent Dirichlet Allocation with Gibbs sampling☆16Updated 10 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- ... just because nltk is too heavy☆36Updated 14 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 6 years ago