epk2112/fairseq_meta_mms_Google_Colab_implementation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/epk2112/fairseq_meta_mms_Google_Colab_implementation)

epk2112 / fairseq_meta_mms_Google_Colab_implementation

The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇

☆19

Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation

Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rajatbansal01 / OIC-2020
View on GitHub
AI and IoT based Smart Parking
☆10Apr 15, 2022Updated 4 years ago
aifenaike / Intent-Recognition-Using-BERT
View on GitHub
Transformer-based Model to recognize any of 7 unique intents from the Snips personal voice assistant.
☆13Mar 11, 2022Updated 4 years ago
ekapolc / Thai_commonvoice_split
View on GitHub
scripts for cleaning and creating train/validation/test splits for Thai commonvoice
☆12Sep 2, 2021Updated 4 years ago
tsunrise / everybody-compose
View on GitHub
Everybody Compose: Deep Beats To Music
☆12Apr 12, 2023Updated 3 years ago
willblaschko / ComfyUI-Unload-Models
View on GitHub
Gives the option to unload one or all models based on memory needs in your flow.
☆28Jun 30, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
wannaphong / thaigpt-next
View on GitHub
It is fine-tune the GPT-Neo model for Thai language.
☆12Jun 30, 2021Updated 5 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
Layer-norm / ComfyUI-Taiyi
View on GitHub
TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model
☆11Sep 1, 2024Updated last year
EnkrateiaLucca / openai_whisper
View on GitHub
Python script for my article and Youtube video on building a streamlit app to use whisper for speech-to-text transcription
☆15Mar 17, 2023Updated 3 years ago
Adibian / Persian-MultiSpeaker-Tacotron2
View on GitHub
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
☆13Oct 2, 2025Updated 9 months ago
mdturp / low-light-image-enhancement
View on GitHub
See what is in the dark fully functioning in the browser.
☆20Jan 29, 2024Updated 2 years ago
dioptx / InhaleSense
View on GitHub
A deep learning approach for respiratory audio discovery and classification.
☆14Sep 30, 2024Updated last year
krolaw / fst
View on GitHub
Free Subliminal Text
☆11Mar 28, 2019Updated 7 years ago
ekapolc / gowajee_corpus
View on GitHub
Thai smart home corpus with "Gowajee" hotword
☆19Jul 30, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
codehz / deno-mtproto
View on GitHub
MTProto for Deno
☆15Jul 6, 2024Updated 2 years ago
struts2spring / sql-editor
View on GitHub
SQL editor is a GUI for SQL. SQL editor is free, open source, Integrated Development Environment(IDE) for working with SQL in SQLite data…
☆12Apr 22, 2022Updated 4 years ago
hegdenaveen1 / WebAR-Demo
View on GitHub
☆10Jun 13, 2021Updated 5 years ago
Lhx94As / E2E-language-diarization
View on GitHub
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Jan 23, 2022Updated 4 years ago
Lane-G-Smith / ChatGPT-Google-Sheets-Apps-Script
View on GitHub
A simple no-code solution for integrating OpenAI's GPT language models into your google sheets documents using Apps Scripts
☆13Mar 31, 2025Updated last year
rhasspy / glow-speak
View on GitHub
Neural text to speech system that uses eSpeak as a text/phoneme front-end
☆16Oct 20, 2021Updated 4 years ago
mertemin / bill-generator-turkish
View on GitHub
Bill Generator in Turkish
☆14Sep 12, 2014Updated 11 years ago
rhasspy / glow-tts-train
View on GitHub
An implementation of GlowTTS designed to work with Gruut
☆12Mar 9, 2022Updated 4 years ago
BlueSkyXN / jd-scripts-docker
View on GitHub
京东薅羊毛脚本，自动签到，做任务等docker一键启动。有使用上的问题可以加qq群644989387交流。【以上内容为原作者说明】
☆10Feb 8, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
unreal79 / pic2wav
View on GitHub
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Jan 25, 2023Updated 3 years ago
ksm26 / Prompt-Engineering-with-Llama-2
View on GitHub
The course provides guidance on best practices for prompting and building applications with the powerful open commercial license models …
☆14Mar 27, 2024Updated 2 years ago
gyataro / osuT5
View on GitHub
Automatically generate osu! beatmaps with T5 model.
☆11Sep 19, 2023Updated 2 years ago
tihu-nlp / tihudict
View on GitHub
Tihu dictionary for Persian language
☆13Sep 8, 2019Updated 6 years ago
junkka / tesseract-web-box-editor
View on GitHub
Tesseract OCR box file web editor
☆12Jun 22, 2023Updated 3 years ago
de-mh / g2p_fa
View on GitHub
A Grapheme to Phoneme model using LSTM implemented in pytorch
☆14Jul 6, 2022Updated 4 years ago
mrhan1993 / ComfyUI-Fooocus
View on GitHub
☆13Jan 15, 2025Updated last year
genexus-books / gx-super-app
View on GitHub
Design, Architecture and Documentation for conversion of Apps into Super Apps Topics
☆19May 9, 2025Updated last year
edoost / pert
View on GitHub
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
☆10Nov 15, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zhangp365 / ComfyUI_photomakerV2_native
View on GitHub
☆13Nov 24, 2025Updated 8 months ago
yuryleb / garmin-russian-tts-voices
View on GitHub
Дополнения и исправления для русских TTS-голосов из навигаторов Garmin
☆14Mar 22, 2026Updated 4 months ago
manga109 / panel-order-estimator
View on GitHub
A simple tool to estimate the reading order of comic panels
☆20Nov 14, 2022Updated 3 years ago
smarteasy / open-prompt
View on GitHub
☆21Mar 10, 2026Updated 4 months ago
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 4 months ago
viyiviyi / comfyui-encrypt-image
View on GitHub
半个 comfyui 图片加密扩展
☆12Sep 24, 2024Updated last year
KoreTeknology / ComfyUI-Nai-Production-Nodes-Pack
View on GitHub
A set of Custom Nodes for Compositing for ComfyUI
☆16Nov 24, 2024Updated last year