An implementation of data augmentation methods for natural language processing tasks.
☆13Jul 25, 2024Updated last year
Alternatives and similar repositories for data-augmentation-for-nlp
Users that are interested in data-augmentation-for-nlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Survey on machine learning.☆14Nov 28, 2020Updated 5 years ago
- Seminar: intro to deep learning with tensorflow☆13Jun 27, 2017Updated 8 years ago
- Use-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).☆20Feb 11, 2020Updated 6 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Aug 28, 2020Updated 5 years ago
- General Purpose Point Cloud Feature Extractor☆13Mar 1, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆25May 11, 2024Updated last year
- ☆13Apr 4, 2022Updated 3 years ago
- Chinese word segmentation with the neural seq2seq model implement in pytorch☆10Dec 13, 2017Updated 8 years ago
- Relevant code for the "Show Your Work" paper, EMNLP 2019.☆18Sep 9, 2019Updated 6 years ago
- Code for paper Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach by Zhe Lin, Xiaojun Wan. This…☆14Aug 10, 2021Updated 4 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- Humans consulting HCH☆10Sep 23, 2017Updated 8 years ago
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorch☆12Jan 6, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This project compares manually curated knowledge graphs with those automatically generated by Ollama Gemma 7B, a large language model (LL…☆16Jun 14, 2024Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 10 months ago
- ☆13Aug 26, 2019Updated 6 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- ☆11Dec 17, 2020Updated 5 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆13Jan 18, 2020Updated 6 years ago
- ☆17Dec 27, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- KPI time-series analysis using deep neural networks☆13Feb 28, 2019Updated 7 years ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆23Aug 2, 2021Updated 4 years ago
- Intro to Machine Learning and Deep Learning for Earth-Life Sciences☆14Jun 29, 2019Updated 6 years ago
- Label Efficient Learning From Explanations☆22Mar 9, 2022Updated 4 years ago
- I2T2I: Text-to-Image Synthesis with textual data augmentation☆30Mar 21, 2019Updated 7 years ago
- Please visit this repo for enhanced and updated open source code☆14Dec 14, 2025Updated 3 months ago
- A text-to-speech command line tool backed by Azure Cognitive Services.☆19Feb 16, 2026Updated last month
- An Objective-C port of Lucene 5.x☆12Jun 6, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Utilizing nbdev in Google Colaboratory☆14Apr 12, 2023Updated 2 years ago
- This repo contains my jupyter notebook for a data challenge for building a machine learning model to identify fraud in e-commerce transac…☆13Apr 3, 2017Updated 8 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Oct 27, 2022Updated 3 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Apr 8, 2022Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Jun 18, 2024Updated last year
- This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…☆14Feb 2, 2020Updated 6 years ago
- PyTorch implementation of StackGAN paper using BERT embeddings☆12Feb 6, 2022Updated 4 years ago