chitholian / AI-Search-AlgorithmsLinks
This is an educational repository containing implementation of some search algorithms in Artificial Intelligence.
☆26Updated 6 years ago
Alternatives and similar repositories for AI-Search-Algorithms
Users that are interested in AI-Search-Algorithms are comparing it to the libraries listed below
Sorting:
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆21Updated last month
- Transformer based Bangla Speech Recognition | Encoder Decoder Architecture☆53Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- Dippy Synthetic Speech Subnet☆17Updated 2 weeks ago
- The Vokan Architecture (Tsukasa speech based)☆10Updated 7 months ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆34Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆26Updated 7 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆18Updated 10 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆48Updated 7 months ago
- Onnx compatible styletts2 code☆13Updated 3 months ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆30Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆28Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆67Updated this week
- ☆11Updated 3 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆20Updated 3 months ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆28Updated 6 months ago
- ☆49Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆30Updated 10 months ago
- Mispronunciation detection code for jingju singing voice☆20Updated 7 years ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆17Updated last year
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆12Updated 3 years ago
- ☆60Updated last year
- ☆17Updated 4 years ago