Tools, wrappers, etc... for data science with a concentration on text processing
☆207Nov 9, 2022Updated 3 years ago
Alternatives and similar repositories for rosetta
Users that are interested in rosetta are comparing it to the libraries listed below
Sorting:
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆31Feb 2, 2026Updated last month
- Sklearn implementation of GBM to predict mu(X) and std(X) on heteroscedastic data☆25Jun 3, 2016Updated 9 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18May 2, 2025Updated 10 months ago
- Fast, easy and intuitive machine learning prototyping.☆124Jun 3, 2014Updated 11 years ago
- ☆13Nov 30, 2015Updated 10 years ago
- Machine learning in nim☆12Aug 16, 2014Updated 11 years ago
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆477Sep 14, 2023Updated 2 years ago
- topics Models extension for Mallet & scikit-learn☆49Mar 27, 2017Updated 8 years ago
- A project for clustering text streams using locality-sensitive hashing (LSH) in Python☆26Sep 23, 2011Updated 14 years ago
- ☆10Feb 13, 2024Updated 2 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Sep 30, 2015Updated 10 years ago
- Yet another tensor library☆23Mar 29, 2017Updated 8 years ago
- Pitman-Yor processes in python☆26Apr 18, 2014Updated 11 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Dec 15, 2017Updated 8 years ago
- Distributed text analysis suite based on Celery☆95Dec 15, 2022Updated 3 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Nov 18, 2020Updated 5 years ago
- ☆20Jun 26, 2017Updated 8 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Mar 27, 2015Updated 10 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Mar 27, 2024Updated last year
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Jun 28, 2015Updated 10 years ago
- Peter Taylor research☆10Jul 18, 2015Updated 10 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆68Dec 4, 2014Updated 11 years ago
- C++ library for modeling with Pitman-Yor processes☆34Nov 28, 2017Updated 8 years ago
- Notes to accompany Thomas Piketty, Capital in the Twenty-First Century (Harvard University Press, 2014)☆12May 10, 2020Updated 5 years ago
- ☆10Jan 30, 2017Updated 9 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Nov 29, 2016Updated 9 years ago
- ☆11Apr 13, 2017Updated 8 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- A lightweight Python script that fetches data from a Google spreadsheet, transforms to JSON, then optionally commits a data file to a Git…☆10Feb 23, 2023Updated 3 years ago
- Finance 6470: Derivatives Markets☆10Apr 15, 2021Updated 4 years ago
- Turbo topics find significant multiword phrases in topics.☆46Jun 16, 2015Updated 10 years ago
- Tools for tracking stories on news homepages☆48Oct 22, 2019Updated 6 years ago
- Scikit-learn compatible tools using theano☆365Feb 28, 2017Updated 9 years ago
- People. Places. Things. Graphs.☆93Oct 2, 2014Updated 11 years ago
- Efficient, concise stream data processing.☆12Jul 22, 2015Updated 10 years ago
- A web application that recommends songs via "country arithmetic" and hand-rolled Implicit Matrix Factorization☆10May 5, 2017Updated 8 years ago
- This is the repo with the code snippets that supply the "R + Google Analytics = FUN" post regarding getting speed metrics and clickstream…☆32Jun 24, 2016Updated 9 years ago
- Calculation of electricity CO₂ intensity at national, state, and NERC regions from 2001-present☆13Nov 25, 2019Updated 6 years ago