MLSpeech/speech_yolo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MLSpeech/speech_yolo)

MLSpeech / speech_yolo

SpeechYOLO Interspeech 2019

☆45

Alternatives and similar repositories for speech_yolo

Users that are interested in speech_yolo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago
cat-state / clip_benchmark
View on GitHub
clip retrieval benchmark
☆17May 4, 2022Updated 4 years ago
sarvan0506 / yolo-midas
View on GitHub
Combine YOLOv3 with MiDaS with a single Resnext101 backbone for Autonomous Navigation
☆25Jan 17, 2021Updated 5 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
soskuthy / gamm_strategies
View on GitHub
Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"
☆10Jan 25, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qcri / Arabic_speech_code_switching
View on GitHub
The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…
☆15Apr 3, 2022Updated 4 years ago
dtreskunov / tiny-kaldi
View on GitHub
Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.
☆16Nov 14, 2020Updated 5 years ago
netankit / AudioMLProject3
View on GitHub
Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…
☆16Jun 28, 2015Updated 11 years ago
asrp / python-espeak
View on GitHub
Python C extension for the eSpeak speech synthesizer
☆12Jan 23, 2021Updated 5 years ago
MoongMoong / MRCG_python
View on GitHub
☆10Mar 21, 2018Updated 8 years ago
VCasecnikovs / Yet-Another-YOLOv4-Pytorch
View on GitHub
YOLOv4 Pytorch implementation with all freebies and specials and 15+ more exclusive improvements. Easy to use!
☆132Aug 3, 2021Updated 4 years ago
aiovine / converse-dataset
View on GitHub
Natural language dataset for training a Conversational Recommender System
☆11Jul 9, 2019Updated 7 years ago
liminxian / Tracklet-Association-Unsupervised-Deep-Learning
View on GitHub
Pytorch code for Tracklet Association Unsupervised Deep Learning (TAUDL)
☆16Jan 5, 2021Updated 5 years ago
wangyu09 / exkaldi-rt
View on GitHub
An online speech recognition extension toolkit of Kaldi
☆55Jun 23, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
THUNLP-MT / L2Copy4APE
View on GitHub
Learning to Copy for Automatic Post-Editing (EMNLP 2019)
☆11May 6, 2021Updated 5 years ago
MiviaLab / DENet
View on GitHub
This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.
☆42Jul 23, 2023Updated 3 years ago
nsmartinez / WERpp
View on GitHub
Calculates the Word Error Rate between two text files
☆20Nov 10, 2022Updated 3 years ago
maverickjoy / pepper-robot-facedetection-open-domain-answering
View on GitHub
Pepper Robot Enhanced Human Interaction
☆14Dec 8, 2022Updated 3 years ago
Kartikaggarwal98 / Indian_ParallelCorpus
View on GitHub
Curated list of publicly available parallel corpus for Indian Languages
☆36Jul 15, 2021Updated 5 years ago
Shrutii07 / 8051-Programming
View on GitHub
Assembly and C codes to interface various components and communication protocols for 8051-microcontroller
☆10Apr 27, 2021Updated 5 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
KarthikBalakrishnan11 / Object-Counter-using-Opencv-Instance-Segmentation
View on GitHub
Object Counter using Opencv Instance Segmentation - Mask R-CNN
☆12Aug 3, 2019Updated 6 years ago
andi611 / Mockingjay-Speech-Representation
View on GitHub
Official Implementation of Mockingjay in Pytorch
☆55Jul 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AppleHolic / PytorchSR
View on GitHub
Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 8 years ago
zhuzilin / chatgpt-desktop
View on GitHub
Desktop version of ChatGPT, support manually set cookie
☆19Dec 9, 2022Updated 3 years ago
zijin-gu / NeuroGen
View on GitHub
Code for paper NeuroGen: activation optimized image synthesis for discovery neuroscience.
☆12Sep 24, 2023Updated 2 years ago
prmelehan / Speaker-Recognition
View on GitHub
Recognizing a speaker using Deep Learning
☆11Dec 25, 2017Updated 8 years ago
srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
aws-samples / content-based-item-recommender
View on GitHub
☆10Apr 2, 2024Updated 2 years ago
Tossy0423 / darknet_ros
View on GitHub
YOLO ROS: Real-Time Object Detection for ROS
☆21Sep 20, 2023Updated 2 years ago
CVxTz / COLA_pytorch
View on GitHub
COLA contrastive pre-training method implemented in PyTorch
☆44Jan 27, 2021Updated 5 years ago
emckiernan / electrophys
View on GitHub
Electrophysiology practicals for undergraduate students
☆13Mar 8, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HiPatil / real-time-digit-recognition
View on GitHub
☆10Jan 23, 2020Updated 6 years ago
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
axellkir / S3VAE
View on GitHub
☆12Mar 23, 2021Updated 5 years ago
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
er537 / whisper_interpretability
View on GitHub
A repo to do interpretability of pre-trained acoustic models
☆15Oct 15, 2023Updated 2 years ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
View on GitHub
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆14Jun 6, 2023Updated 3 years ago
witko0 / kaldifordummies
View on GitHub
Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…
☆11May 29, 2016Updated 10 years ago