Python Voice Activity Detection for Chat Bots
☆14Mar 31, 2019Updated 6 years ago
Alternatives and similar repositories for PythonVAD
Users that are interested in PythonVAD are comparing it to the libraries listed below
Sorting:
- ☆10Feb 17, 2023Updated 3 years ago
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12May 26, 2024Updated last year
- Codeformer Tensorrt Face Restoration☆13Apr 15, 2024Updated last year
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- Chinese character recognition☆10Oct 27, 2020Updated 5 years ago
- Face Swap☆12Jun 2, 2023Updated 2 years ago
- A demo project demonstrating the performance improvement by cpp extension, which wrapped with pybind11.☆10Nov 16, 2021Updated 4 years ago
- Notebooks etc. Analysis of SNOMED-CT for the Clinical Coding Pilot and related work☆14Jan 10, 2021Updated 5 years ago
- English ASR Challenge organized by Speech Lab, IIT Madras☆11Feb 3, 2021Updated 5 years ago
- 基于官方提供的CosyVoice改造,整体交互适配CosyVoice2模型,开箱即用☆22Jun 15, 2025Updated 8 months ago
- This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…☆13May 15, 2024Updated last year
- image-transfer-with-background-preserved, based on AnimeGANv2 and Mask-RCNN☆15Jun 6, 2024Updated last year
- A .jar file with keywords for enaml syntax highlighting☆11Nov 23, 2020Updated 5 years ago
- Portrait matting model for academic use only.☆13Jan 7, 2024Updated 2 years ago
- find landmark from dog face☆11Jun 6, 2022Updated 3 years ago
- This repository contains the complete source code of the MedTAG annotation tool. MedTAG is a biomedical annotation tool for tagging biome…☆12Jan 1, 2023Updated 3 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆15Feb 17, 2023Updated 3 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- A Voice Command driven Virtual Assistant for running Statistical Analysis☆14Mar 30, 2019Updated 6 years ago
- Unsupervised Speaker Clustering & Speaker Recognition☆13Jan 7, 2019Updated 7 years ago
- deidentify patient notes using pre-trained BERT☆14Jan 11, 2026Updated last month
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- Code Implementation of TDS Article "Semi-supervised Intent Classification with GAN-BERT"☆14Aug 19, 2020Updated 5 years ago
- A Disease-Symptoms Network and a system that predicts diseases from symptoms using a decision tree classifier.☆14Sep 23, 2020Updated 5 years ago
- Web application for easy and convenient viewing of OCR results.☆15Apr 13, 2021Updated 4 years ago
- ☆21Mar 7, 2025Updated 11 months ago
- Check the grammar of a given sentence using BERT and ULMFIT.☆15Mar 20, 2021Updated 4 years ago
- In this notebook i implement clinical text classfication on the medical transcription dataset from kaggle☆13Jul 14, 2020Updated 5 years ago
- ☆18Jul 29, 2022Updated 3 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- ☆23Apr 10, 2025Updated 10 months ago
- ☆16Dec 18, 2023Updated 2 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Jan 12, 2023Updated 3 years ago
- ☆18Oct 22, 2021Updated 4 years ago
- Running MedCAT as a RESTful web service☆22Jun 23, 2025Updated 8 months ago
- Arabic Speech Recognition with Whisper: Fine-tune the Whisper model from OpenAI for Arabic speech recognition tasks. This repository prov…☆21Feb 28, 2024Updated last year
- ☆17Dec 7, 2019Updated 6 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago