A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.
☆51Oct 23, 2014Updated 11 years ago
Alternatives and similar repositories for text-as-data
Users that are interested in text-as-data are comparing it to the libraries listed below
Sorting:
- Bokeh tutorial, PyData Berlin☆10May 29, 2015Updated 10 years ago
- Browser-based annotation tool for Framenet☆16Jan 27, 2015Updated 11 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Apr 10, 2014Updated 11 years ago
- gnowledge studio is a python django project for collaboratively creating and publishing knowledge (semantic) networks as blogging graphs.☆51Mar 13, 2013Updated 12 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Aug 3, 2011Updated 14 years ago
- Easy to follow text classifying implementation using a Conv. Neural Network (Tensorflow)☆15Apr 22, 2017Updated 8 years ago
- Python API for KB data-services☆19Jan 30, 2020Updated 6 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Mar 27, 2024Updated last year
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Mar 8, 2015Updated 10 years ago
- Tree-adjoining grammar based statistical dependency parser using a general linear model (glm).☆28Feb 8, 2017Updated 9 years ago
- Explore different deep-learning frameworks☆18Jun 14, 2018Updated 7 years ago
- Presentations for JuliaCon☆70Mar 7, 2016Updated 9 years ago
- Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…☆27Dec 14, 2020Updated 5 years ago
- Tools for tracking stories on news homepages☆48Oct 22, 2019Updated 6 years ago
- Dynamic Community Finding☆26Mar 12, 2018Updated 7 years ago
- MITIE: library and tools for information extraction☆29Jan 22, 2015Updated 11 years ago
- Tools and Libraries for Lexicon-Based Sentiment Analysis☆24Sep 18, 2016Updated 9 years ago
- The Summarizer from the Web IR / NLP Group (WING), hence SWING, is a modular, state-of-the-art automatic extractive text summarization sy…☆38Nov 10, 2014Updated 11 years ago
- My Tutorial for PyData London☆26Jun 18, 2015Updated 10 years ago
- The news homepage archive☆80Oct 3, 2021Updated 4 years ago
- Attempts to determine the natural language of a selection of Unicode (utf-8) text (a clone of http://code.google.com/p/guess-language wit…☆48Feb 22, 2010Updated 16 years ago
- Vector Space Model Framework developed for InPhO☆39May 9, 2025Updated 9 months ago
- The Moodboard Plugin is pretty self-descriptive: it creates moodboards! Just type in the topic you want to be inspired on, and get a whol…☆12May 27, 2018Updated 7 years ago
- The goal of this experiment is to take articles and certain metadata and group them by topic.☆11Apr 14, 2016Updated 9 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Jan 6, 2022Updated 4 years ago
- Cloud Mining automatically builds exploratory faceted search systems.☆52Oct 15, 2013Updated 12 years ago
- AI program that uses word associations, a directed weighted graph, and machine learning☆12Jun 5, 2010Updated 15 years ago
- ☆33Feb 27, 2014Updated 12 years ago
- Digitization information system build on top of Fedora repository☆16Jan 15, 2019Updated 7 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- Dense Wireless Connectivity Datasets for the IoT.☆11Aug 13, 2019Updated 6 years ago
- Small utility that loads any downloaded JSON databases from www.phishtank.com into Redis cache for quick local queries☆11Aug 8, 2016Updated 9 years ago
- EZ conversion of OmniOutliner .ooutline files to github/gitlab friendly markdown. Works via a single Python function call, can easily be …☆11Dec 1, 2017Updated 8 years ago
- PyData Boston 2013 talks: "Intro to scikit-learn" & "Realtime Predictive Analytics: Using scikit-learn and RabbitMQ"☆11Jan 5, 2014Updated 12 years ago
- Green SqlAlchemy extensions for pulsar☆11Nov 24, 2017Updated 8 years ago
- Alfred Workflow to get recent files in folders☆13Nov 28, 2022Updated 3 years ago
- Bicycle Incident reporting☆13Jul 22, 2022Updated 3 years ago
- Sequential anomaly detection method evaluation☆18Mar 9, 2013Updated 12 years ago
- Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/de…☆10Jun 22, 2020Updated 5 years ago