slanglab/phrasemachine

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/slanglab/phrasemachine)

slanglab / phrasemachine

Quickly extract multi-word phrases from a corpus

☆193

Alternatives and similar repositories for phrasemachine

Users that are interested in phrasemachine are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alexbyrnes / Datapiece
View on GitHub
Investigative tool for extracting relevant areas from many documents
☆14Nov 17, 2015Updated 10 years ago
sunlightlabs / read_FEC
View on GitHub
Turn raw electronic FEC filings into meaningful data
☆19May 20, 2016Updated 10 years ago
TaddyLab / deepir
View on GitHub
deep inverse regression
☆31Nov 3, 2015Updated 10 years ago
okdistribute / knead
View on GitHub
Resolve data table conflicts
☆17Jun 11, 2015Updated 11 years ago
ropenscilabs / tif
View on GitHub
Text Interchange Formats
☆38Nov 26, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alexbyrnes / FCC-Political-Ads_The-Code
View on GitHub
Code for extracting data from a large number of PDFs, particularly FCC political ad documents
☆15Oct 26, 2017Updated 8 years ago
oduwsdl / sumgram
View on GitHub
sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)
☆55Aug 1, 2024Updated last year
src-d / snippet-ranger
View on GitHub
☆11Nov 17, 2017Updated 8 years ago
trinker / rnltk
View on GitHub
☆18Feb 6, 2016Updated 10 years ago
ChristopherLucas / MatchingFrontier
View on GitHub
Optimal pruning for imbalance minimization in causal inference
☆18Sep 7, 2020Updated 5 years ago
quanteda / spacyr
View on GitHub
R wrapper to spaCy NLP
☆253Feb 3, 2025Updated last year
parry2403 / R2N2
View on GitHub
RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis
☆12Sep 2, 2015Updated 10 years ago
Ironholds / retractr
View on GitHub
Open Retractions API client
☆13Apr 16, 2017Updated 9 years ago
brendano / mte
View on GitHub
MiTextExplorer - interactive browser of text and document covariates.
☆24Jun 17, 2015Updated 11 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
scottpham / twitterBot
View on GitHub
Twitter Bots!
☆10Sep 2, 2014Updated 11 years ago
chambm / AhoCorasickTrie
View on GitHub
An R package that implements fast searching for multiple keywords in multiple texts.
☆11Feb 5, 2025Updated last year
kateto / PolitwoopsR
View on GitHub
Extract deleted tweet & politician data from the Politwoops project
☆24May 14, 2017Updated 9 years ago
src-d / ml-core
View on GitHub
source{d} MLonCode foundation - core algorithms and models.
☆13Oct 17, 2019Updated 6 years ago
AbeHandler / contracts_nlp
View on GitHub
Uses NLP methods to parse and classify contracts from The City of New Orleans
☆10Mar 23, 2015Updated 11 years ago
sckott / rforcats
View on GitHub
☆46Oct 28, 2024Updated last year
discourse-lab / DiscourseSegmenter
View on GitHub
A collection of various discourse segmenters
☆10Jun 30, 2017Updated 9 years ago
matthewjdenny / preText
View on GitHub
An R package to assess the effects of text preprocessing decisions.
☆68Jul 25, 2021Updated 4 years ago
gaborcsardi / maxygen
View on GitHub
OUTDATED Markdown + Roxygen = Maxygen
☆52Oct 21, 2015Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 2 weeks ago
matthewjdenny / SpeedReader
View on GitHub
High Performance Text Processing in R
☆100Mar 18, 2020Updated 6 years ago
sf-wa-326 / phrase-bert-topic-model
View on GitHub
☆86Dec 5, 2021Updated 4 years ago
wrathematics / dequer
View on GitHub
A deque for R.
☆29Mar 13, 2022Updated 4 years ago
cbail / textnets
View on GitHub
R package to perform automated text analysis using network techniques
☆224Nov 11, 2023Updated 2 years ago
Georgetown-IR-Lab / emnlp17-depression
View on GitHub
☆12Aug 2, 2024Updated last year
civisanalytics / muffnn
View on GitHub
Multilayer Feed-Forward Neural Network predictive model implementations with TensorFlow and scikit-learn
☆45Nov 29, 2022Updated 3 years ago
fnl / segtok
View on GitHub
Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…
☆170Dec 15, 2021Updated 4 years ago
edwindj / daff
View on GitHub
Diff, patch and merge for data.frames, see http://paulfitz.github.io/daff/
☆159Feb 15, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WinVector / replyr
View on GitHub
Patches for using dplyr with Databases and Big Data
☆67Oct 18, 2020Updated 5 years ago
BenjaminDHorne / The-NELA-Toolkit
View on GitHub
The News Landscape Toolkit (NELA)
☆16Oct 14, 2020Updated 5 years ago
boudinfl / pke
View on GitHub
Python Keyphrase Extraction module
☆1,590Jul 12, 2023Updated 3 years ago
ryanjgallagher / shifterator
View on GitHub
Interpretable data visualizations for understanding how texts differ at the word level
☆290Jun 30, 2026Updated 3 weeks ago
jonsafari / clustercat
View on GitHub
Fast Word Clustering Software
☆79Feb 8, 2025Updated last year
davben / arvig
View on GitHub
An R data package containing georeferenced events of right-wing violence in Germany from 2014 onwards
☆11Jun 27, 2018Updated 8 years ago
ropensci-archive / rjsonapi
View on GitHub
ARCHIVED Consumer for APIs that Follow the JSON API Specification
☆29May 10, 2022Updated 4 years ago