Audio processing using deep neural networks. Speaker identification using voice embeddings.
☆13Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for voice-embeddings
Users that are interested in voice-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Eurorack Busboard☆13Oct 24, 2019Updated 6 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- the official chroma pickle wrapper☆20Apr 3, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- The Multi-band Excited WaveNet☆16Feb 2, 2023Updated 3 years ago
- ☆12Nov 14, 2022Updated 3 years ago
- ☆30Jun 23, 2022Updated 3 years ago
- Eurorack projects☆13Jun 5, 2015Updated 10 years ago
- Documentation of the Two!Ears Auditory Model☆13Feb 14, 2019Updated 7 years ago
- Get an OpenCV video capture from an YouTube video URL☆27Aug 26, 2024Updated last year
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- This repository contains implementation of A2C with GAE, which is used to control robot in MuJoCo environment.☆10Jan 6, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Obsolete repo, merged into eynollah☆12Sep 29, 2025Updated 6 months ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Batch processing using joblib including tqdm progress bars☆20Dec 29, 2021Updated 4 years ago
- CORALL (COLREGs-guided Risk Aware LLM) is a novel framework that integrates Large Language Models with real-time risk assessment for auto…☆24Feb 11, 2026Updated 2 months ago
- Extended Kalman filter for attitude estimation on a multi-IMU configuration☆13Sep 2, 2022Updated 3 years ago
- ☆13Mar 31, 2026Updated 2 weeks ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 2 years ago
- ☆31Feb 28, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Jun 18, 2023Updated 2 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10May 2, 2024Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 5 years ago
- Vector Database Lite (like SQLITE but for vectors)☆13Jul 10, 2022Updated 3 years ago
- ☆13Jan 8, 2024Updated 2 years ago
- A CLI tool for finding the files that count 🤠🔫☆13Feb 24, 2025Updated last year
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- ☆16Jul 6, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository is a version of VINS-Fusion with gpu acceleration for OpenCV 4.☆17Aug 2, 2021Updated 4 years ago
- Detailed introduction of TFmini-Plus☆17Sep 27, 2019Updated 6 years ago
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆12Aug 9, 2024Updated last year
- ☆20Jan 5, 2023Updated 3 years ago
- CyberAgent AI Lab研修: "モデルコードの高速化・最適化チュートリアル"☆35Mar 13, 2025Updated last year
- A multi-stage phase shifter in Eurorack format with up to 12 stages selectable☆25May 15, 2025Updated 11 months ago
- In this repository, we deal with developing different estimators to localize Transvahan - the e-vehicle on IISc Campus using measurements…☆19Jul 2, 2020Updated 5 years ago