Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
☆132Jul 15, 2019Updated 6 years ago
Alternatives and similar repositories for phrase-at-scale
Users that are interested in phrase-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python word cloud library for use within Jupyter notebook and Python apps.☆50May 15, 2024Updated 2 years ago
- ☆15Mar 19, 2017Updated 9 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Jul 27, 2018Updated 7 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Oct 22, 2019Updated 6 years ago
- Free and open source Tableau alternative that generates Python Pandas code☆12Aug 23, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Dec 7, 2022Updated 3 years ago
- Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regr…☆1,185Dec 2, 2020Updated 5 years ago
- RxNLP APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity …☆15Jan 24, 2020Updated 6 years ago
- List of papers on concept prerequisite learning.☆36Sep 7, 2018Updated 7 years ago
- This repo contains code and dataset for the Opinosis Summarization Framework☆53Nov 14, 2019Updated 6 years ago
- jgtextrank: Yet another Python implementation of TextRank☆14Nov 27, 2019Updated 6 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆80Dec 28, 2021Updated 4 years ago
- Social Media Machine Translation Toolkit☆21Sep 13, 2013Updated 12 years ago
- CNN text classification using keras☆16Nov 27, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Unsupervised domain adaptation method for relation extraction☆18Jul 16, 2018Updated 7 years ago
- An online spatiotemporal data visualizer using D3.js, Leaflet.js and Crossfilter☆18Oct 31, 2017Updated 8 years ago
- Biomedical wordlists (of drugs, genes, etc) for several text mining projects☆19Apr 3, 2023Updated 3 years ago
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…☆10Apr 9, 2017Updated 9 years ago
- Embeddings for all geonames populated locations with population greater than 0☆13May 15, 2017Updated 9 years ago
- Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].☆19Dec 9, 2022Updated 3 years ago
- ☆14Feb 22, 2022Updated 4 years ago
- Miscellaneous utility functions☆11Nov 17, 2016Updated 9 years ago
- store my personal project☆22Jun 4, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Corpora, tools and resources for Turkish NLP☆14May 27, 2020Updated 6 years ago
- Large-scale topic discovery with Sampled-MinHashing☆10Jul 3, 2019Updated 6 years ago
- Model for predicting categories of entities by its mentions☆31Jun 23, 2021Updated 4 years ago
- Visualize word embeddings of a vocabulary in TensorBoard, including the neighbors☆46Jul 18, 2017Updated 8 years ago
- Entity Linking within a Social Media Platform☆11May 2, 2019Updated 7 years ago
- A Web-Based Visualization Tool for Biclustering of Multivariate Time Series☆10Feb 17, 2023Updated 3 years ago
- Bit Error Rate (BER) and Frame Error Rate (FER) references. Most of those results have been simulated with AFF3CT.☆15May 15, 2026Updated last month
- Generating Dataset for Google's Text Summarization Code☆33Dec 17, 2018Updated 7 years ago
- ☆14Oct 21, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Oct 30, 2023Updated 2 years ago
- Ensemble Machine Learning for Time Series: Ensemble of Deep Recurrent Neural Networks and Random forest using a Stacking (averaging) laye…☆33Aug 23, 2017Updated 8 years ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆18Jul 20, 2023Updated 2 years ago
- Developing different methods for expanding a query/topic in information retrieval and choosing the best expanded query using similarity m…☆11May 17, 2017Updated 9 years ago
- ☆17Aug 29, 2019Updated 6 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆439Apr 7, 2023Updated 3 years ago
- Very Simple Question Answer System using Chinese Wikipedia Data☆24May 18, 2024Updated 2 years ago