kamperh / nlp817
Natural Language Processing 817
☆17Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for nlp817
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- fiwGAN/ciwGAN (Featural and Categorical InfoWaveGAN): Generative Adversarial Phonology and Semantics☆23Updated last year
- A guide to building language technology in new languages.☆57Updated 2 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆62Updated 2 years ago
- ☆22Updated last year
- Code and data for Koenecke et al. (2020)☆28Updated last year
- ☆56Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆82Updated 8 months ago
- Scripts to create speech corpora from open.bible☆12Updated 2 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated last year
- Transcribe your videos and translate it into Indic languages.☆29Updated last week
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- scipts for working with open.bible data☆23Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆13Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 3 years ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆41Updated 3 years ago
- ☆9Updated last year
- Balanced Error Rate for Speaker Diarization☆25Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆72Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆15Updated 2 weeks ago
- docker for HF wav2vec2-sprint☆12Updated 3 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆11Updated 3 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆32Updated 5 months ago