bedapudi6788 / txt2txt
Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.
☆39Updated 3 years ago
Related projects: ⓘ
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆21Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆60Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 3 months ago
- Fast and accurate spell correction library☆74Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- ✨ Web interface for NeuralCoref coreference resolution☆34Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Python library for converting UTF to WX and vice-versa for Indian languages.☆48Updated 2 years ago
- Source code for the Apple reproduction☆30Updated 3 years ago
- Deep neural approach to Boundary and Disfluency Detection - Based on my Master's work☆19Updated last month
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 4 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Updated 2 years ago
- A python true casing utility that restores case information for texts☆88Updated last year
- Speeech Recognition for Indic languages.☆11Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆85Updated 3 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated last month
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- ☆16Updated last month
- docker for HF wav2vec2-sprint☆12Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated last month
- German small and large versions of GPT2.☆19Updated 2 years ago
- Build a dialog dataset from online books in many languages☆71Updated last year