☆40Jul 19, 2018Updated 7 years ago
Alternatives and similar repositories for Looking-to-Listen
Users that are interested in Looking-to-Listen are comparing it to the libraries listed below
Sorting:
- Looking to listen at cocktail party☆36Mar 24, 2023Updated 2 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆224Jul 17, 2019Updated 6 years ago
- Include some core functions and model to handle speech separation☆156Jun 24, 2021Updated 4 years ago
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆115Feb 15, 2017Updated 9 years ago
- Executable code based on Google articles☆166Dec 8, 2022Updated 3 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Dec 30, 2019Updated 6 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Jul 6, 2023Updated 2 years ago
- Robust Speech Activity Detection (SAD) in movie audio☆26Jan 27, 2021Updated 5 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆58Oct 4, 2019Updated 6 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated last year
- SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.☆35Feb 20, 2017Updated 9 years ago
- A simple javascript utility library to include partial html (iframe alternate) without a framework or jQuery.☆17Oct 21, 2022Updated 3 years ago
- AVSpeech downloader☆68Jan 30, 2019Updated 7 years ago
- Create, run, and schedule routines with Mycroft☆30Jan 16, 2022Updated 4 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆77Nov 9, 2019Updated 6 years ago
- A lightweight .NET Core console program to merge multiple TIFF files into one.☆12Jul 30, 2019Updated 6 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Dec 3, 2018Updated 7 years ago
- Client Side basit bir UDF ( Uyap Döküman Formatı) Okuyucu☆16Oct 23, 2018Updated 7 years ago
- Kuantum bilişim alanında Türkçe kaynak oluşturmayı amaçlayan, temel kavramlar, algoritmalar ve uygulama örnekleri içeren açık kaynaklı bi…☆16Oct 3, 2025Updated 5 months ago
- extractor chinese synonyms in large corpus☆11Jul 20, 2016Updated 9 years ago
- My first ever training of a piper tts voice☆16May 23, 2025Updated 9 months ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆11Jan 1, 2023Updated 3 years ago
- 📄Source code variable naming using a seq2seq architecture☆10Mar 19, 2020Updated 5 years ago
- Scripts to work with IRS 990 XML data☆10Jan 11, 2019Updated 7 years ago
- jQuery plugin for HID based RFid readers☆12Oct 16, 2013Updated 12 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Examination Questions in the Dept. of Computer Science and Electronic Engineering.☆11Apr 2, 2025Updated 11 months ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Dec 8, 2022Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Nov 21, 2022Updated 3 years ago
- Codebase for ECCV18 "The Sound of Pixels"☆391Apr 25, 2022Updated 3 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- 🗞 Monitors data sources, alerts you when they change☆13Jul 23, 2021Updated 4 years ago
- A little web app that indexes geographic data layers available via ESRI REST endpoints so they are searchable.☆12Feb 24, 2026Updated last week
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- A CUDA powered audio decoding framework for FLAC.☆11May 22, 2018Updated 7 years ago
- A collection of tools and guides for Empire Earth☆12Jun 24, 2022Updated 3 years ago
- webgl helper library☆11Jan 12, 2023Updated 3 years ago