manhph2211/ViSTT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/manhph2211/ViSTT)

manhph2211 / ViSTT

I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...

☆17

Alternatives and similar repositories for ViSTT

Users that are interested in ViSTT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

manhph2211 / ViOCR
View on GitHub
This is our solution dealing with BKAI challenge
☆66Jul 24, 2022Updated 4 years ago
Davido111200 / QuestionAnswering_demoVbdi
View on GitHub
This project aims to build an English Question Answering web application. Instructions are given below. Have fun using our program :D
☆19Nov 13, 2022Updated 3 years ago
manhph2211 / MC-OCR
View on GitHub
The task aims at extracting required fields in receipts captured by mobile devices
☆35Nov 4, 2022Updated 3 years ago
ngocphucck / Facial-Authentification-System
View on GitHub
This project aims to build a streamlit app which includes face detection, face recognition, face anti-spoofing attacks and sentiment anal…
☆32Oct 1, 2022Updated 3 years ago
manhph2211 / ML-Deployment
View on GitHub
Pushing Deep Learning models into production using torchserve, kubernetes and react web app
☆27Jun 15, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
manhph2211 / HRNet-Pose-Estimation
View on GitHub
This repo is about implementing pose estimation with HRNet and also, is a sub-task of the smart hospital bed project
☆12Jan 21, 2022Updated 4 years ago
manhph2211 / ViTTS
View on GitHub
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system In general, I used Portaspeech as an…
☆12Nov 24, 2023Updated 2 years ago
manhph2211 / CP-PPG
View on GitHub
Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)
☆21Apr 10, 2026Updated 3 months ago
manhph2211 / ViSR
View on GitHub
This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand
☆39May 23, 2023Updated 3 years ago
manhph2211 / D-BETA
View on GitHub
An ECG Foundation Model: Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners (ICML 2025)
☆36Mar 7, 2026Updated 4 months ago
lacie-life / FruitCountingEngine
View on GitHub
Fruit yield estimation system using UAV
☆11Dec 7, 2022Updated 3 years ago
kh4nh12 / ViSoMeCens
View on GitHub
Vietnamese Social Media Censorship Application
☆15Sep 6, 2023Updated 2 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fakerybakery / simpletts
View on GitHub
A lightweight Python library for running TTS models with a unified API.
☆20Feb 18, 2025Updated last year
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
thamquocdung / eCMU
View on GitHub
eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)
☆10Oct 30, 2024Updated last year
alexisdmacintyre / SpeechBreathingToolbox
View on GitHub
Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.
☆11Feb 17, 2024Updated 2 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
lacie-life / TurtleBot3-MPC
View on GitHub
TurtleBot3-MPC
☆19Jun 3, 2022Updated 4 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
projectlucas / efficient_whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆19Dec 1, 2022Updated 3 years ago
trangnth / ghichep-prometheus
View on GitHub
Ghi chép trong quá trình tìm hiểu Prometheus, cảnh báo qua sms, telegram, slack, gmail
☆13Sep 17, 2022Updated 3 years ago
xmos / sln_voice
View on GitHub
XCORE-VOICE Solution
☆20Apr 8, 2026Updated 3 months ago
JosefAlbers / e2tts-mlx
View on GitHub
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆29Oct 15, 2024Updated last year
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
lukaszliniewicz / breath-removal
View on GitHub
Detect and remove or lower the volume of breathing in speech recordings.
☆17May 14, 2025Updated last year
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nguyenhongson1902 / algorithmic-toolbox-solutions
View on GitHub
This repo includes all of the solutions to the Algorithmic Toolbox course from Coursera
☆10Oct 10, 2022Updated 3 years ago
Zhongxu-Wang / ArtSpeech
View on GitHub
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆22Sep 21, 2025Updated 10 months ago
nguyenhongson1902 / lunar-lander-solver
View on GitHub
This is my project to solve the Lunar Lander environment using the Deep Q-Learning Algorithm with Experience Replay
☆12Jan 3, 2023Updated 3 years ago
csalt-research / accented-codebooks-asr
View on GitHub
☆19Sep 10, 2024Updated last year
goepfert / noise_reduction
View on GitHub
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
☆22Jun 7, 2023Updated 3 years ago
gladiaio / normalization
View on GitHub
A lightweight library for normalizing speech transcripts before computing WER
☆28Jul 14, 2026Updated last week
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year