xiabingquan / Automatic-Speech-Recognition-from-ScratchView external linksLinks
An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer
☆84Apr 29, 2024Updated last year
Alternatives and similar repositories for Automatic-Speech-Recognition-from-Scratch
Users that are interested in Automatic-Speech-Recognition-from-Scratch are comparing it to the libraries listed below
Sorting:
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Apr 27, 2024Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- Awesome Automatic Speech Recognition (ASR) paper collection☆22Sep 4, 2020Updated 5 years ago
- a PyTorch implementation of Lip2Wav☆50Oct 2, 2022Updated 3 years ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆16Nov 24, 2025Updated 2 months ago
- Paster core module using KiteX☆10Aug 30, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- 共享文档☆10Aug 1, 2024Updated last year
- Pytorch implementation for DeepSpeech 2.0☆31Jul 25, 2024Updated last year
- A voice spoofing detection system, based on paper presented at ICSPIS 2021☆10Feb 11, 2022Updated 4 years ago
- ☆39Oct 19, 2025Updated 3 months ago
- ☆11Jun 15, 2019Updated 6 years ago
- Train I3D on NTU-RGB+D dataset in keras☆12Feb 5, 2019Updated 7 years ago
- ☆11Aug 20, 2025Updated 5 months ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- Simple, Non authoritative Benchmarks for embedded databases running in Github Actions☆11Jul 11, 2024Updated last year
- A/B Test knowledge system(AB实验知识体系).☆12Sep 24, 2020Updated 5 years ago
- DI, IoC container / DI、IoC 容器☆14Nov 9, 2023Updated 2 years ago
- concurrent map implementation using bucket list like a skip list.☆10May 29, 2022Updated 3 years ago
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆47Oct 26, 2024Updated last year
- Chinese speech recognition | 中文语音识别 (使用AISHELL-3数据集训练语音识别模型)☆11Oct 17, 2024Updated last year
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"☆14Feb 9, 2026Updated last week
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆17Jun 22, 2022Updated 3 years ago
- Relation Classification via Convolutional Deep Neural Network☆13Nov 9, 2018Updated 7 years ago
- Docker base images for C++ development using vcpkg☆11Jan 27, 2026Updated 2 weeks ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- Praat scripting入门☆15Apr 8, 2025Updated 10 months ago
- A simple dictionary in Manchu, Chinese and English.☆13Feb 27, 2015Updated 10 years ago
- ☆14Mar 9, 2023Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- An alternative to elasticsearch engine written in Go for small set of documents that uses inverted index to build the index and utilizes …☆15Jun 14, 2020Updated 5 years ago
- Make sure your remote stays up to date with changes to local code☆18Jun 26, 2020Updated 5 years ago
- 🚀 海南大学编译原理 pl0 语言编译器扩充☆10Dec 19, 2020Updated 5 years ago
- A handwriting font with full support for Hudum Mongolian, Sibe, Manchu and Manchu Ali Gali☆11Jun 26, 2022Updated 3 years ago
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆10Dec 11, 2024Updated last year