Official repo for the Vietnam-Celeb dataset
☆26Aug 27, 2023Updated 2 years ago
Alternatives and similar repositories for Vietnam-Celeb.Interspeech
Users that are interested in Vietnam-Celeb.Interspeech are comparing it to the libraries listed below
Sorting:
- ☆11Mar 9, 2023Updated 3 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- ☆12Nov 12, 2024Updated last year
- VietGPT VoiceBot: Chatbot automatically recognizes Vietnamese voice and uses the ChatGPT API for natural language interaction.☆41May 21, 2024Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Oct 6, 2023Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆104Jun 21, 2024Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- This repository provides some useful snippets that you may need in some situations.☆12Jan 16, 2024Updated 2 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- ☆11Oct 24, 2022Updated 3 years ago
- zero shot NER fine tuning☆14Mar 17, 2025Updated 11 months ago
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- ☆15Feb 25, 2023Updated 3 years ago
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆54Dec 4, 2024Updated last year
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- A Pytorch Lightning WGAN-gp to generate faces☆11Jan 26, 2021Updated 5 years ago
- ☆14Apr 23, 2025Updated 10 months ago
- vntokenizer 4.1 by LE-HONG Phuong☆11Dec 13, 2016Updated 9 years ago
- Local text-to-speech in your browser with Piper TTS☆17Aug 13, 2025Updated 6 months ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- ☆46Jan 22, 2024Updated 2 years ago
- Deploy docs from your source tree to a GitHub wiki☆13Jun 14, 2023Updated 2 years ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Mar 2, 2026Updated last week
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated 11 months ago
- Batch-based installer for ostris/ai-toolkit. Sets up a Python 3.12 virtual environment, installs PyTorch with CUDA 12.8, Triton, and all …☆33Feb 18, 2026Updated 2 weeks ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆20Nov 10, 2025Updated 3 months ago
- Investigating Cultural Alignment of Large Language Models☆13Aug 14, 2024Updated last year
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆19Aug 15, 2025Updated 6 months ago
- openvino version of openai/whisper☆15Oct 8, 2024Updated last year
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25-SEA)☆16Aug 11, 2025Updated 6 months ago
- An AI assistant can help you with content composition right in your Microsoft Word☆17Sep 10, 2024Updated last year
- Flask-SocketIO个人翻译练习的,API部分没翻译。☆14Jan 11, 2017Updated 9 years ago
- Zalo Text-To-Speech for python☆11May 10, 2021Updated 4 years ago
- 中文停用词汇总,持续完善中,欢迎push共建☆16Jun 12, 2023Updated 2 years ago