syedsarfarazakhtar / Word-Similarity-Datasets-for-Indian-LanguagesLinks
Cite: http://www.aclweb.org/anthology/W/W17/W17-08.pdf#page=103
☆8Updated 8 years ago
Alternatives and similar repositories for Word-Similarity-Datasets-for-Indian-Languages
Users that are interested in Word-Similarity-Datasets-for-Indian-Languages are comparing it to the libraries listed below
Sorting:
- Python library for converting UTF to WX and vice-versa for Indian languages.☆47Updated 3 years ago
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆58Updated 6 years ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Religious Hate Speech Detection for Arabic Tweets☆24Updated 6 years ago
- ☆69Updated 2 years ago
- A machine learning model explainer that works on top of Apache Spark☆18Updated 8 years ago
- Code and data from our ACL 2014 paper "Humans Require Context to Infer Ironic Intent (so Computers Probably do, too)"☆15Updated 11 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- ☆30Updated 5 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Aggressive reddit scraper in node js☆13Updated 10 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Miscellaneous scripts to gather and process data of wikis.☆21Updated 2 years ago
- A repo for sharing language resources related to the outbreak (in machine readable format)☆25Updated 5 years ago
- Doing things with embeddings☆66Updated 2 years ago
- allennlp tutorial for O'Reilly AI Conference, September 2019☆22Updated 5 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆43Updated 2 years ago
- ACL 2020 papers by authors who are members of underrepresented groups (URMs)☆17Updated 5 years ago
- spaCy + UDPipe☆162Updated 3 years ago
- Transliteration module for Indian Languages☆79Updated last year
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 6 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 8 years ago
- Visualizations of character embeddings from derived character vectors.☆13Updated 8 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated 2 years ago
- Tokenizer for Twitter and Reddit data☆46Updated 6 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago