Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
☆131Jul 15, 2019Updated 6 years ago
Alternatives and similar repositories for phrase-at-scale
Users that are interested in phrase-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python word cloud library for use within Jupyter notebook and Python apps.☆49May 15, 2024Updated last year
- Free and open source Tableau alternative that generates Python Pandas code☆12Aug 23, 2018Updated 7 years ago
- Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regr…☆1,182Dec 2, 2020Updated 5 years ago
- 提取出判决书中的金额项和金额数。☆11Apr 8, 2016Updated 10 years ago
- RxNLP APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity …☆15Jan 24, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SWIG Wrapper for the SRILM toolkit☆35Oct 5, 2020Updated 5 years ago
- List of papers on concept prerequisite learning.☆36Sep 7, 2018Updated 7 years ago
- Extract Unique Word Lists From Wikipedia Database☆13May 27, 2020Updated 5 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆80Dec 28, 2021Updated 4 years ago
- Social Media Machine Translation Toolkit☆21Sep 13, 2013Updated 12 years ago
- Recom.live — the real-time recommendation system☆10Jul 6, 2023Updated 2 years ago
- CNN text classification using keras☆16Nov 27, 2017Updated 8 years ago
- An online spatiotemporal data visualizer using D3.js, Leaflet.js and Crossfilter☆18Oct 31, 2017Updated 8 years ago
- ☆123Apr 12, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Embeddings for all geonames populated locations with population greater than 0☆13May 15, 2017Updated 8 years ago
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…☆10Apr 9, 2017Updated 9 years ago
- Miscellaneous utility functions☆11Nov 17, 2016Updated 9 years ago
- store my personal project☆22Jun 4, 2020Updated 5 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Apr 3, 2019Updated 7 years ago
- Corpora, tools and resources for Turkish NLP☆14May 27, 2020Updated 5 years ago
- Large-scale topic discovery with Sampled-MinHashing☆10Jul 3, 2019Updated 6 years ago
- Model for predicting categories of entities by its mentions☆31Jun 23, 2021Updated 4 years ago
- Visualize word embeddings of a vocabulary in TensorBoard, including the neighbors☆46Jul 18, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Entity Linking within a Social Media Platform☆11May 2, 2019Updated 6 years ago
- Bit Error Rate (BER) and Frame Error Rate (FER) references. Most of those results have been simulated with AFF3CT.☆15Oct 29, 2025Updated 5 months ago
- Generating Dataset for Google's Text Summarization Code☆33Dec 17, 2018Updated 7 years ago
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Oct 30, 2023Updated 2 years ago
- Ensemble Machine Learning for Time Series: Ensemble of Deep Recurrent Neural Networks and Random forest using a Stacking (averaging) laye…☆33Aug 23, 2017Updated 8 years ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆18Jul 20, 2023Updated 2 years ago
- Developing different methods for expanding a query/topic in information retrieval and choosing the best expanded query using similarity m…☆11May 17, 2017Updated 8 years ago
- ☆17Aug 29, 2019Updated 6 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆439Apr 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A visualisation tool for Spacy using Hierplane.☆64Jan 25, 2023Updated 3 years ago
- Very Simple Question Answer System using Chinese Wikipedia Data☆24May 18, 2024Updated last year
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Apr 20, 2021Updated 4 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- ☆14Jun 9, 2019Updated 6 years ago
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆116May 3, 2024Updated last year