shitian-ni / speech-recognition-transfer-learningView external linksLinks
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
☆17Jan 19, 2018Updated 8 years ago
Alternatives and similar repositories for speech-recognition-transfer-learning
Users that are interested in speech-recognition-transfer-learning are comparing it to the libraries listed below
Sorting:
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Jun 4, 2019Updated 6 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 8 years ago
- Supporting code for instrumentation courses at Universidade Nova de Lisboa - Faculdade de Ciência de Lisboa☆16Oct 7, 2022Updated 3 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Matlab code for learning doubly sparse dictionary on synthetic data. Details can be found in the paper "A Provable Approach for Double-Sp…☆11Mar 5, 2018Updated 7 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 7 years ago
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- Tool for slot extraction from text☆15Oct 23, 2022Updated 3 years ago
- dotfiles for frontend-developer and python-user, including: vim(support vue files and python pylint), tmux, zsh(with oh-my-zsh)☆11Jan 29, 2026Updated 2 weeks ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- This is a sample implementation of "TIMERS: Error-Bounded SVD Restart on Dynamic Networks"(AAAI 2018).☆12Jul 4, 2018Updated 7 years ago
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Website for LLM360☆14Jan 22, 2026Updated 3 weeks ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 4 years ago
- Code examples for Smaller C, O'Reilly☆14Mar 22, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Boundaries and Region Representation Fusion☆12Mar 24, 2023Updated 2 years ago
- Experiments and tutorials with and for torchaudio☆13May 7, 2021Updated 4 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "☆14Jul 19, 2024Updated last year
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Functions for creating speech features in MATLAB.☆13Jul 7, 2020Updated 5 years ago
- This is a custom library for data processing, visualization and machine learning tools.☆14Dec 28, 2025Updated last month
- Clustering algorithms (Mean shift and K-Means) from scratch in NumPy, PyTorch, TensorFlow, and JAX☆11Oct 3, 2022Updated 3 years ago
- Your friendly neighbourhood image resizing API. Resize or aspect, you're cropping with me.☆11Apr 29, 2021Updated 4 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing☆45Aug 1, 2024Updated last year
- Edinburgh Speech Tools☆63Jun 10, 2023Updated 2 years ago
- Audio WAV file tools for C# read and write, 8 and 16 bits, mono and stereo.☆11Oct 20, 2015Updated 10 years ago
- A repository that is a workflow of Anthony Reis's "Writing Interpreters and Compilers for the Raspberry Pi Using Python"☆10Apr 1, 2021Updated 4 years ago
- [ECCV2022] Revisiting the Critical Factors of Augmentation-Invariant Representation Learning☆12Aug 3, 2022Updated 3 years ago
- Dataset☆13Feb 4, 2021Updated 5 years ago
- Demos - approaches to sync with server.☆25Mar 18, 2014Updated 11 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago
- pyTorch variational autoencoder, with explainations☆11May 31, 2017Updated 8 years ago