hadifar / stc_clusteringView external linksLinks
☆70Mar 8, 2024Updated last year
Alternatives and similar repositories for stc_clustering
Users that are interested in stc_clustering are comparing it to the libraries listed below
Sorting:
- One short text dataset for classification and clustering extracted from StackOverflow☆59Feb 7, 2018Updated 8 years ago
- A command line tool for training deep network models for short text classification☆20Jul 15, 2019Updated 6 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Author Profiling for Abuse Detection (COLING 2018)☆10Dec 8, 2022Updated 3 years ago
- Implementation of Deep Soft-K means☆29Apr 28, 2021Updated 4 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- Code and data for paper "Dialog Intent Induction with Deep Multi-View Clustering", Hugh Perkins and Yi Yang, 2019, EMNLP 2019☆67Jul 6, 2023Updated 2 years ago
- This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"☆17Apr 22, 2021Updated 4 years ago
- This repository contains the code for our paper "Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive …☆13Nov 23, 2023Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- 我的百度机器阅读理解竞赛模型代码 ,获得 final 第三名☆14Jul 26, 2018Updated 7 years ago
- Use CNN to realize word classification☆14Mar 16, 2019Updated 6 years ago
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆16May 12, 2020Updated 5 years ago
- Short Text Topic Modeling, JAVA☆160May 24, 2020Updated 5 years ago
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- ICAE code(An Image Clustering Auto-Encoder Based on Predefined Evenly-Distributed Class Centroids and MMD Distance)☆18Apr 20, 2023Updated 2 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Aug 4, 2024Updated last year
- Large scale sentential paraphrases collection and annotation☆46Dec 31, 2022Updated 3 years ago
- Deep Context Modeling for Multi-turn Response Selection in Dialogue Systems☆20Oct 12, 2020Updated 5 years ago
- "Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation☆19Dec 23, 2018Updated 7 years ago
- Replication of "Auto-encoder Based Data Clustering" Song et al☆27Apr 21, 2018Updated 7 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Jun 3, 2019Updated 6 years ago
- Code accompanying the paper "A Generative Framework for Zero Shot Learning with Adversarial Domain Adaptation"☆20Feb 1, 2021Updated 5 years ago
- Code for unsupervised aspect extraction, using Keras and its Backends☆91Jul 6, 2023Updated 2 years ago
- This repository contains two independent news datasets used in the 2017 study: "This Just In: Fake News Packs a Lot in Title, Uses Simple…☆30Apr 7, 2017Updated 8 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆32Nov 28, 2022Updated 3 years ago
- A corpus of comments tagged for multiple attributes of unhealthiness.☆36Mar 25, 2021Updated 4 years ago
- This project helps to detect the mathematical formula from the given picture and the same formula is extracted and converted into the lat…☆12Oct 13, 2021Updated 4 years ago
- a framework for Medical Image Segmentation and Filtering☆10Mar 23, 2017Updated 8 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆79Apr 14, 2019Updated 6 years ago
- Neural (LSTM) version of the partial CRF model☆34Aug 4, 2019Updated 6 years ago
- Code base for representation learning of very short texts, such as tweets. By Cedric De Boom, IBCN, Ghent University, Belgium.☆34Apr 21, 2016Updated 9 years ago
- X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification☆142Apr 27, 2021Updated 4 years ago
- TextAugment: Text Augmentation Library☆431Dec 10, 2025Updated 2 months ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆83Dec 5, 2018Updated 7 years ago
- ☆36Oct 1, 2020Updated 5 years ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,086Jul 23, 2019Updated 6 years ago
- Machine Learning for Healthcare☆10Mar 28, 2020Updated 5 years ago
- ☆10Apr 20, 2019Updated 6 years ago