Official repo for the Vietnam-Celeb dataset
☆26Aug 27, 2023Updated 2 years ago
Alternatives and similar repositories for Vietnam-Celeb.Interspeech
Users that are interested in Vietnam-Celeb.Interspeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 9, 2023Updated 3 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- Demo of building and intergraition MCP Server☆20Apr 9, 2025Updated last year
- vntokenizer 4.1 by LE-HONG Phuong☆11Dec 13, 2016Updated 9 years ago
- Conmato: A Command Line Interface (CLI) for Codeforces Management Tools that helps coach to manage Codeforces group easier☆10Feb 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Investigating Cultural Alignment of Large Language Models☆13Aug 14, 2024Updated last year
- ☆20Oct 3, 2021Updated 4 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆23Jun 5, 2025Updated 11 months ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- ☆46Jan 22, 2024Updated 2 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆67Jan 1, 2025Updated last year
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆15Apr 13, 2026Updated 3 weeks ago
- ☆12Nov 12, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆13Sep 29, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25-SEA)☆17Aug 11, 2025Updated 8 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆687Dec 25, 2024Updated last year
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- The pytorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition☆25Dec 16, 2021Updated 4 years ago
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆58Dec 4, 2024Updated last year
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆22Nov 10, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Modular Pluralism @ EMNLP 2024☆26Sep 20, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆22Jun 7, 2025Updated 11 months ago
- A web based command line interface in a Docker container, based on ttyd.☆11Mar 15, 2021Updated 5 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training☆15Aug 9, 2024Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Oct 10, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆109Jun 21, 2024Updated last year
- openvino version of openai/whisper☆15Oct 8, 2024Updated last year
- ☆11Oct 24, 2022Updated 3 years ago
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'202…☆10Aug 13, 2024Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago