cvlab-columbia / voicecamoView external linksLinks
Code for the paper Real-Time Neural Voice Camouflage
☆28Apr 13, 2022Updated 3 years ago
Alternatives and similar repositories for voicecamo
Users that are interested in voicecamo are comparing it to the libraries listed below
Sorting:
- Maze algorithms implemented in JavaScript - many maze generators and tiling patterns☆16Oct 2, 2022Updated 3 years ago
- Code of paper "AdvReverb: AdvReverb: Rethinking the Stealthiness of Audio Adversarial Examples to Human Perception"☆18Nov 26, 2023Updated 2 years ago
- This repository collects papers related to Speech Tokenizer.☆17Oct 16, 2024Updated last year
- Official repo of ICASSP 2022 paper - Don't Separate, Learn to Remix: End-to-End Neural Remixing with Joint Optimization☆20Jan 7, 2025Updated last year
- [ACM MM 24] GROOT:Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis☆20Mar 24, 2025Updated 10 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Continual Learning Method RAWM for ICML 2023☆23Sep 26, 2024Updated last year
- ☆24Jun 25, 2025Updated 7 months ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 3 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- Read image segmentation masks fast☆13Jul 25, 2024Updated last year
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆66Dec 13, 2024Updated last year
- When can you tell whether an image has been cropped or not?☆29Sep 19, 2021Updated 4 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Feb 17, 2022Updated 4 years ago
- A lightweight library for Frechet Audio Distance calculation.☆308Updated this week
- Official repo of ISMIR-21 publication, “A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset”.☆83Nov 17, 2021Updated 4 years ago
- Machine learning tools and framework for automatic music transcription.☆36Jun 17, 2024Updated last year
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆71Nov 11, 2025Updated 3 months ago
- Python scripts to process EVS (Event-based vision sensor) data☆10Jan 30, 2024Updated 2 years ago
- Text Corpus of African American Fiction and Poetry, from 1853-1923☆10Aug 5, 2020Updated 5 years ago
- MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …☆12Feb 10, 2026Updated last week
- Assembly language (汇编语言程序设计 第三版 王爽)☆12Aug 17, 2022Updated 3 years ago
- Fully Local Push-to-Transcribe☆17Nov 6, 2025Updated 3 months ago
- Token classification using Phobert Models for Vietnamese☆13Jul 8, 2022Updated 3 years ago
- Apparel Classification for Indian Ethnic Clothes☆12Feb 10, 2023Updated 3 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆228Apr 26, 2023Updated 2 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- SouPyX: An Audio Exploration Space.🪐☆42Nov 28, 2023Updated 2 years ago
- Dockerfile for johnsmith0031/alpaca_lora_4bit☆12Apr 10, 2023Updated 2 years ago
- springboot环境下java调用c程序生成动态链接库(.so文件),并调用(基于JNI,Ubuntu)☆11Aug 26, 2024Updated last year
- Recreating the phase functioned neural network in unreal engine 5☆15May 12, 2024Updated last year
- ☆11Feb 24, 2023Updated 2 years ago
- End-to-end OMR system based on deep learning.☆38Mar 25, 2022Updated 3 years ago
- GPT-5 and Opus 4.1 implementations of one-shot coding examples☆17Feb 6, 2026Updated last week
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆15Jul 24, 2025Updated 6 months ago
- MATLAB code for solving the Euclidean Distance Matrix completion problem.☆10Nov 20, 2017Updated 8 years ago
- a new stem dataset for Music Demixing research, from the OnAir royalty-free music project☆37Mar 14, 2023Updated 2 years ago
- ☆11Apr 18, 2024Updated last year
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago