An implementation of data augmentation methods for natural language processing tasks.
☆13Jul 25, 2024Updated last year
Alternatives and similar repositories for data-augmentation-for-nlp
Users that are interested in data-augmentation-for-nlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Survey on machine learning.☆14Nov 28, 2020Updated 5 years ago
- Use-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).☆20Feb 11, 2020Updated 6 years ago
- ☆12Jun 29, 2024Updated last year
- ☆13Apr 4, 2022Updated 4 years ago
- Relevant code for the "Show Your Work" paper, EMNLP 2019.☆18Sep 9, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for paper Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach by Zhe Lin, Xiaojun Wan. This…☆14Aug 10, 2021Updated 4 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- ☆13Jul 1, 2021Updated 4 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorch☆12Jan 6, 2021Updated 5 years ago
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- ☆13Aug 26, 2019Updated 6 years ago
- ☆12Oct 22, 2019Updated 6 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Dec 17, 2020Updated 5 years ago
- ☆12Apr 3, 2026Updated last month
- ☆15Jul 29, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆13Jan 18, 2020Updated 6 years ago
- TPU support for the fastai library☆13Apr 15, 2021Updated 5 years ago
- Evaluation cross-media retrieval using a new protocol.☆11Mar 14, 2017Updated 9 years ago
- ☆17Dec 27, 2021Updated 4 years ago
- Deep convolutional conditional GAN implementation with CelebA dataset that allows for generation of custom faces according to textual inp…☆18Jun 15, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Intro to Machine Learning and Deep Learning for Earth-Life Sciences☆14Jun 29, 2019Updated 6 years ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆23Aug 2, 2021Updated 4 years ago
- Label Efficient Learning From Explanations☆22Mar 9, 2022Updated 4 years ago
- Please visit this repo for enhanced and updated open source code☆13Dec 14, 2025Updated 4 months ago
- A text-to-speech command line tool backed by Azure Cognitive Services.☆19Feb 16, 2026Updated 2 months ago
- An Objective-C port of Lucene 5.x☆12Jun 6, 2021Updated 4 years ago
- Paraphrase Generation Using Deep Reinforcement Learning - MSc Thesis☆18Jun 10, 2020Updated 5 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆27Feb 14, 2023Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Jun 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆12Apr 8, 2022Updated 4 years ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction☆26May 22, 2024Updated last year
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Feb 2, 2020Updated 6 years ago
- ☆15Dec 12, 2019Updated 6 years ago
- Server side API for QANTA quiz bowl system☆10Jan 31, 2019Updated 7 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆50May 3, 2021Updated 5 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year