Quickly extract multi-word phrases from a corpus
☆195Jun 25, 2020Updated 5 years ago
Alternatives and similar repositories for phrasemachine
Users that are interested in phrasemachine are comparing it to the libraries listed below
Sorting:
- Investigative tool for extracting relevant areas from many documents☆14Nov 17, 2015Updated 10 years ago
- deep inverse regression☆31Nov 3, 2015Updated 10 years ago
- A collection of various discourse segmenters☆10Jun 30, 2017Updated 8 years ago
- An R package that implements fast searching for multiple keywords in multiple texts.☆11Feb 5, 2025Updated last year
- ☆18Feb 6, 2016Updated 10 years ago
- ☆46Oct 28, 2024Updated last year
- Code for EMNLP'20 paper "When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models"☆11Nov 10, 2020Updated 5 years ago
- Text Interchange Formats☆37Nov 26, 2023Updated 2 years ago
- ☆88Dec 5, 2021Updated 4 years ago
- NYT Risk Semantics Project☆12Mar 5, 2016Updated 10 years ago
- R wrapper to spaCy NLP☆252Feb 3, 2025Updated last year
- Interactive visualization of non-linear logistic regression decision boundaries☆28Jul 24, 2014Updated 11 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Sep 2, 2015Updated 10 years ago
- Open Retractions API client☆13Apr 16, 2017Updated 8 years ago
- Prism is a tool for collective interpretation. It's an ongoing experiment by the Praxis Program at the University of Virginia Scholars' L…☆25Oct 6, 2022Updated 3 years ago
- sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)☆56Aug 1, 2024Updated last year
- Extract deleted tweet & politician data from the Politwoops project☆24May 14, 2017Updated 8 years ago
- Fast Word Clustering Software☆79Feb 8, 2025Updated last year
- MiTextExplorer - interactive browser of text and document covariates.☆24Jun 17, 2015Updated 10 years ago
- Resolve data table conflicts☆17Jun 11, 2015Updated 10 years ago
- source{d} MLonCode foundation - core algorithms and models.☆14Oct 17, 2019Updated 6 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- A toolkit for corpus linguistics☆205Jun 11, 2019Updated 6 years ago
- Python Keyphrase Extraction module☆1,588Jul 12, 2023Updated 2 years ago
- OUTDATED Markdown + Roxygen = Maxygen☆52Oct 21, 2015Updated 10 years ago
- Beautiful visualizations of how language differs among document types.☆2,330Apr 29, 2025Updated 10 months ago
- Patches for using dplyr with Databases and Big Data☆67Oct 18, 2020Updated 5 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Sep 20, 2021Updated 4 years ago
- Tools for reshaping text data☆53Apr 1, 2024Updated last year
- A deque for R.☆29Mar 13, 2022Updated 3 years ago
- Diff, patch and merge for data.frames, see http://paulfitz.github.io/daff/☆157Feb 15, 2024Updated 2 years ago
- An R package to assess the effects of text preprocessing decisions.☆66Jul 25, 2021Updated 4 years ago
- ARCHIVED Consumer for APIs that Follow the JSON API Specification☆29May 10, 2022Updated 3 years ago
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 5 years ago
- how I FOIA (and maybe how you can too!)☆21Mar 20, 2018Updated 7 years ago
- High Performance Text Processing in R☆100Mar 18, 2020Updated 5 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31May 19, 2021Updated 4 years ago
- Fast n-Gram Tokenization☆72Dec 10, 2023Updated 2 years ago
- OSoMe API mashups☆11Jan 29, 2019Updated 7 years ago