Cloud-CV / diverse-beam-searchView external linksLinks
Decoding Diverse Solutions from Neural Sequence Models
☆77Aug 13, 2018Updated 7 years ago
Alternatives and similar repositories for diverse-beam-search
Users that are interested in diverse-beam-search are comparing it to the libraries listed below
Sorting:
- Python package for origami☆17Jan 10, 2019Updated 7 years ago
- ☆89Dec 18, 2016Updated 9 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 2 years ago
- CloudCV - Large-Scale Distributed Computer Vision As A Cloud Service☆51May 9, 2020Updated 5 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Easy to use Django admin interface.☆12Sep 22, 2016Updated 9 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- A Chatbot based on VQA (Visual Question Answering)☆17Nov 25, 2016Updated 9 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- Semantic parsing as machine translation☆24Nov 11, 2016Updated 9 years ago
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 2 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Learning visually grounded word embeddings using Abstract scenes☆18Mar 1, 2019Updated 6 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Jun 13, 2016Updated 9 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 5 months ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Evaluating Visual Conversational Agents via Cooperative Human-AI Games☆23Nov 22, 2022Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- [CVPR 2017] AMT chat interface code used to collect the Visual Dialog dataset☆78Jun 10, 2022Updated 3 years ago
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated 11 months ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- replicate the results of rule extract lstm☆16Jun 9, 2017Updated 8 years ago