Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
β131Jul 15, 2019Updated 6 years ago
Alternatives and similar repositories for phrase-at-scale
Users that are interested in phrase-at-scale are comparing it to the libraries listed below
Sorting:
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Dec 7, 2022Updated 3 years ago
- β15Mar 19, 2017Updated 8 years ago
- β17Aug 29, 2019Updated 6 years ago
- Language Tool style grammar handling with spaCy 2.0β42Jul 27, 2018Updated 7 years ago
- Entity Linking within a Social Media Platformβ11May 2, 2019Updated 6 years ago
- Large-scale topic discovery with Sampled-MinHashingβ10Jul 3, 2019Updated 6 years ago
- πΈDe-inflect Japanese wordsβ15Nov 24, 2025Updated 3 months ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"β18Jul 20, 2023Updated 2 years ago
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcriptionβ¦β10Apr 9, 2017Updated 8 years ago
- β14Feb 22, 2022Updated 4 years ago
- Model for predicting categories of entities by its mentionsβ31Jun 23, 2021Updated 4 years ago
- Free and open source Tableau alternative that generates Python Pandas codeβ12Aug 23, 2018Updated 7 years ago
- Miscellaneous utility functionsβ11Nov 17, 2016Updated 9 years ago
- Recom.live β the real-time recommendation systemβ10Jul 6, 2023Updated 2 years ago
- Corpora, tools and resources for Turkish NLPβ14May 27, 2020Updated 5 years ago
- Developing different methods for expanding a query/topic in information retrieval and choosing the best expanded query using similarity mβ¦β11May 17, 2017Updated 8 years ago
- Extract Unique Word Lists From Wikipedia Databaseβ13May 27, 2020Updated 5 years ago
- Tensorflow Implementation of Neural Conversational Model by Vinyals et.al.β12Sep 3, 2016Updated 9 years ago
- jgtextrank: Yet another Python implementation of TextRankβ13Nov 27, 2019Updated 6 years ago
- Django+echarts+py2neoθΏθ‘η₯θ―εΎθ°±ηεη«―ε±η€Ίβ16Oct 13, 2020Updated 5 years ago
- Bit Error Rate (BER) and Frame Error Rate (FER) references. Most of those results have been simulated with AFF3CT.β15Oct 29, 2025Updated 4 months ago
- Negima is a Python package to extract phrases in Japanese text by using the part-of-speeches based rules you defined.β14Aug 20, 2018Updated 7 years ago
- Dependency or Span, End-to-End Uniform Semantic Role Labelingβ32Nov 23, 2018Updated 7 years ago
- store my personal projectβ22Jun 4, 2020Updated 5 years ago
- An active annotation tool based on brat(https://github.com/nlplab/brat)β19Aug 22, 2017Updated 8 years ago
- β14Jun 9, 2019Updated 6 years ago
- β15May 29, 2021Updated 4 years ago
- Visualize word embeddings of a vocabulary in TensorBoard, including the neighborsβ46Jul 18, 2017Updated 8 years ago
- Generating Dataset for Google's Text Summarization Codeβ33Dec 17, 2018Updated 7 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answeringβ36Apr 20, 2021Updated 4 years ago
- Semantic parsing as machine translationβ24Nov 11, 2016Updated 9 years ago
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabiβ16Oct 30, 2023Updated 2 years ago
- this project is for Semantic role labeling using bertβ36Jan 6, 2019Updated 7 years ago
- Code to run LDA algorithm on Twitter/Foursquare scraped data.β16Aug 22, 2017Updated 8 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)β440Apr 7, 2023Updated 2 years ago
- List of papers on concept prerequisite learning.β36Sep 7, 2018Updated 7 years ago
- EMNLP 2019: CaRe: Open Knowledge Graph Embeddingsβ38Jul 6, 2023Updated 2 years ago
- Break Wikidata dumps into smaller knowledge graphsβ43Oct 7, 2020Updated 5 years ago
- A toolkit for generating paraphrase vector representations for words in contextβ23May 19, 2015Updated 10 years ago