Official repo for the Vietnam-Celeb dataset
☆26Aug 27, 2023Updated 2 years ago
Alternatives and similar repositories for Vietnam-Celeb.Interspeech
Users that are interested in Vietnam-Celeb.Interspeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Mar 9, 2023Updated 3 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- Demo of building and intergraition MCP Server☆21Apr 9, 2025Updated 11 months ago
- vntokenizer 4.1 by LE-HONG Phuong☆11Dec 13, 2016Updated 9 years ago
- Investigating Cultural Alignment of Large Language Models☆13Aug 14, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆20Oct 3, 2021Updated 4 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- ☆46Jan 22, 2024Updated 2 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆67Jan 1, 2025Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 6 months ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25-SEA)☆16Aug 11, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆686Dec 25, 2024Updated last year
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- ☆15Feb 25, 2023Updated 3 years ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- The pytorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition☆25Dec 16, 2021Updated 4 years ago
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆55Dec 4, 2024Updated last year
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆22Nov 10, 2025Updated 4 months ago
- Modular Pluralism @ EMNLP 2024☆25Sep 20, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A web based command line interface in a Docker container, based on ttyd.☆11Mar 15, 2021Updated 5 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- ☆32Mar 3, 2026Updated 3 weeks ago
- An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training☆15Aug 9, 2024Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Oct 10, 2024Updated last year
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆107Jun 21, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- openvino version of openai/whisper☆15Oct 8, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Oct 24, 2022Updated 3 years ago
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆187Sep 1, 2025Updated 6 months ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆19Aug 15, 2025Updated 7 months ago
- This repository provides some useful snippets that you may need in some situations.☆15Mar 10, 2026Updated 2 weeks ago