Audio processing using deep neural networks. Speaker identification using voice embeddings.
☆13Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for voice-embeddings
Users that are interested in voice-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 31, 2022Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- ☆12Nov 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆29Jun 23, 2022Updated 3 years ago
- Get an OpenCV video capture from an YouTube video URL☆27Aug 26, 2024Updated last year
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Jul 23, 2021Updated 4 years ago
- This library allows you to communicate with XSens module using UART.☆11Jan 14, 2020Updated 6 years ago
- The goal of this project is to use scanning lidar to create a map which will enable autonomous navigation of a simple robot☆14Jun 27, 2021Updated 4 years ago
- This repository contains implementation of A2C with GAE, which is used to control robot in MuJoCo environment.☆10Jan 6, 2020Updated 6 years ago
- Underwater Communication & Navigation Laboratory documentation site☆13Apr 23, 2026Updated last week
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- ofxDualShock4 is an openFrameworks addon which accesses the gyroscope and accelerometer data from a PS4 controller, and uses them to esti…☆15Dec 8, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CORALL (COLREGs-guided Risk Aware LLM) is a novel framework that integrates Large Language Models with real-time risk assessment for auto…☆25Feb 11, 2026Updated 2 months ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Lightweight multiple sound source localization, based on a triangular microphone array.☆16Dec 5, 2023Updated 2 years ago
- ApertureDB Python Client☆12Apr 23, 2026Updated last week
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 3 years ago
- ☆15Sep 6, 2021Updated 4 years ago
- ☆11Jul 7, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10May 2, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 5 years ago
- Vector Database Lite (like SQLITE but for vectors)☆13Jul 10, 2022Updated 3 years ago
- A CLI tool for finding the files that count 🤠🔫☆13Feb 24, 2025Updated last year
- Given a job description, the model uses POS and Classifier to determine the skills therein.☆35Aug 11, 2020Updated 5 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- ☆16Jul 6, 2023Updated 2 years ago
- This repository is a version of VINS-Fusion with gpu acceleration for OpenCV 4.☆17Aug 2, 2021Updated 4 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆12Aug 9, 2024Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- Upstash Vector Python SDK☆18Oct 21, 2025Updated 6 months ago
- ☆20Jan 5, 2023Updated 3 years ago
- ☆17Mar 22, 2024Updated 2 years ago
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago