guglielmocamporese/learning_invariances_in_speech_recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guglielmocamporese/learning_invariances_in_speech_recognition)

guglielmocamporese / learning_invariances_in_speech_recognition

In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…

☆20

Alternatives and similar repositories for learning_invariances_in_speech_recognition

Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

andi611 / Conditional-SpecGAN-Tensorflow
View on GitHub
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Dec 12, 2018Updated 7 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
View on GitHub
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Jan 28, 2019Updated 7 years ago
wikke / AudioRecognition
View on GitHub
Google Speech Command Dataset Classification Neural Network, CNN, RNN
☆26Aug 29, 2017Updated 8 years ago
kjcodeacct / golang_flutter_demo
View on GitHub
This was for a DevICT (http://devict.org) presentation given July, 2020.
☆13Jul 9, 2020Updated 6 years ago
mariegold / NP-Attack
View on GitHub
☆10Mar 22, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
idiap / apam
View on GitHub
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…
☆14Feb 15, 2021Updated 5 years ago
RanyaJumah / EDGY
View on GitHub
Privacy-preserving Voice Analysis via Disentangled Representations
☆12Aug 30, 2021Updated 4 years ago
PowerDNS / ipcipher
View on GitHub
ipcipher is a specification for encrypting IP{v4,v6} addresses 'in place'.
☆20Mar 28, 2018Updated 8 years ago
harujoh / TensorFlowLiteNet
View on GitHub
TensorFlowLiteNet allows to use TensorFlowLite from C#.
☆11Apr 14, 2021Updated 5 years ago
nealwon / go-flutter-plugin-sqlite
View on GitHub
try implement go version sqlite channel for flutter sqflite plugin[desktop only]
☆16Jul 16, 2020Updated 6 years ago
nec-research / st_tau
View on GitHub
This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…
☆17Mar 8, 2021Updated 5 years ago
ankane / torchaudio-ruby
View on GitHub
An audio library for Torch.rb
☆21Jun 29, 2026Updated 3 weeks ago
ngr900 / constellation
View on GitHub
A simple JavaScript plugin that creates beautiful, interactive canvas backgrounds
☆15Jan 27, 2019Updated 7 years ago
sdhayalk / TensorFlow_Speech_Recognition_Challenge
View on GitHub
Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…
☆25Jan 19, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hamza5 / Pipeline-diacritizer
View on GitHub
Automatic Arabic diacritics restoration tool.
☆19Aug 12, 2021Updated 4 years ago
pchampio / Caddyr
View on GitHub
Directory theme for caddy caddyfile
☆17Jan 23, 2017Updated 9 years ago
colszowka / rack-fontserve
View on GitHub
Sinatra app for serving web fonts easily with proper caching and access-control headers
☆15Sep 5, 2011Updated 14 years ago
feerci / feerci
View on GitHub
FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates
☆12Mar 13, 2024Updated 2 years ago
uyghur-language / uyghur-language.github.io
View on GitHub
☆13Jul 12, 2026Updated last week
UyCode / uyfonts
View on GitHub
there are UKIJ and Uighursoft fonts
☆13Oct 21, 2022Updated 3 years ago
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆14Oct 14, 2023Updated 2 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
maverick0122 / QueryByHumming
View on GitHub
A query by humming system based on locality sensitive hashing indexes
☆12May 8, 2014Updated 12 years ago
fanlu / wenet
View on GitHub
Transformer based ASR Engine.
☆13Aug 23, 2021Updated 4 years ago
benshaaw / revision
View on GitHub
Study materials created by myself for revision for University of Edinburgh's School of Informatics' exams.
☆12Apr 25, 2018Updated 8 years ago
cvqluu / TDNN
View on GitHub
Time delay neural network (TDNN) implementation in Pytorch using unfold method
☆204Nov 21, 2019Updated 6 years ago
pchampio / othello-prolog
View on GitHub
A fully functional Othello (Reversi) game, with several AIs, made in prolog for swipl.
☆17Apr 26, 2024Updated 2 years ago
dotmilk / emacs-crystal-mode
View on GitHub
Maintained at https://github.com/crystal-lang-tools/emacs-crystal-mode
☆17Nov 7, 2017Updated 8 years ago
UlugbekSalaev / UzTransliterator
View on GitHub
UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language
☆13Jan 6, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
akaihola / darglint2
View on GitHub
A python documentation linter which checks that the docstring description matches the definition. Based on darglint by @terrencepreilly.
☆26Apr 1, 2024Updated 2 years ago
emoncms / dashboard
View on GitHub
dashboard module for emoncms
☆14Jun 9, 2026Updated last month
148nasuka / Vocal2lab
View on GitHub
NNSVS向けの教師データのラベル作成支援ツールです。
☆10Apr 5, 2023Updated 3 years ago
LvHang / pitch
View on GitHub
a standalone pitch extractor
☆13Oct 19, 2017Updated 8 years ago
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
gheyret / UyghurNgram
View on GitHub
Make N-Gram for Uyghur language
☆15Dec 24, 2020Updated 5 years ago
x893 / ProScan
View on GitHub
OBD Scan Tool .NET 2.0
☆13Oct 5, 2015Updated 10 years ago