☆30Sep 6, 2023Updated 2 years ago
Alternatives and similar repositories for data_generation
Users that are interested in data_generation are comparing it to the libraries listed below
Sorting:
- The Benchmark of Linguistic Minimal Pairs☆161Dec 13, 2022Updated 3 years ago
- A framework for Lexical Simplification.☆14Mar 27, 2018Updated 7 years ago
- ☆15Jul 1, 2020Updated 5 years ago
- ☆15Oct 21, 2023Updated 2 years ago
- The code for (explicitly) specializing distributional embedding spaces for semantic similarity☆13May 11, 2018Updated 7 years ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Sep 15, 2021Updated 4 years ago
- Repo for the simplified text alignment tools.☆21Dec 4, 2020Updated 5 years ago
- Bayesian pragmatic models implemented in Python☆20May 11, 2025Updated 9 months ago
- Discourse Based Evaluation of Language Understanding☆21Jan 28, 2023Updated 3 years ago
- Companion site for "Analysis Methods in Neural Language Processing: A Survey"☆66Feb 28, 2020Updated 6 years ago
- Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)☆28Aug 11, 2019Updated 6 years ago
- Specialising Word Vectors for Lexical Entailment☆29Sep 13, 2018Updated 7 years ago
- Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus☆27May 27, 2019Updated 6 years ago
- A retrieve and edit approach to generate sarcasm by reversing valence and adding incongruent common sense context☆32Mar 27, 2021Updated 4 years ago
- Diagnostic tests for linguistic capacities in language models☆65May 7, 2022Updated 3 years ago
- Cross-platform Python client for the CodeReef.ai portal to manage portable workflows, reusable automation actions, software detection plu…☆11Mar 27, 2020Updated 5 years ago
- Python library for Myra☆10Jan 21, 2019Updated 7 years ago
- MoFA☆34Jun 5, 2017Updated 8 years ago
- [AAAI 2019] Code for paper "A Deep Sequential Model for Discourse Parsing on Multi-Party Dialogues"☆79Mar 31, 2023Updated 2 years ago
- A neural language model that estimates incremental processing complexity☆39Oct 27, 2021Updated 4 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆57Nov 19, 2012Updated 13 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- GIT☆13Aug 29, 2024Updated last year
- Android client for the diplicity service.☆10Aug 3, 2021Updated 4 years ago
- Templates etc. for creating experiments using Ibex Farm.☆11Jul 21, 2018Updated 7 years ago
- Implementation of NAACL'19 Strong and Simple Baselines for Multimodal Utterance Embeddings☆10Jun 4, 2019Updated 6 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- ☆20Jul 18, 2018Updated 7 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- A word hashing method based on vectors of letter n-grams. Currently transforms text into sequences of numbers.☆10Feb 27, 2018Updated 8 years ago
- True facts about Jeff Leek: https://yihui.name/en/2017/04/jeff-leek-facts/☆10Oct 27, 2020Updated 5 years ago
- Repo to showcase solution examples and learning content curated by the advanced analytics experts within Microsoft Finance☆17Sep 2, 2022Updated 3 years ago
- Comment toxicity classification using Karas/TensorFlow☆10May 25, 2018Updated 7 years ago
- ☆11Apr 19, 2021Updated 4 years ago
- 대부분의 신문사 뉴스를 수집하는 것을 목적으로 하는 크롤러 제작 프로젝트☆10Jul 29, 2019Updated 6 years ago
- ☆13Jul 8, 2020Updated 5 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking☆12Apr 11, 2025Updated 10 months ago
- ☆13Feb 20, 2020Updated 6 years ago
- A prototype implementation of the ASDF Standard for C++☆11Mar 14, 2023Updated 2 years ago