The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)π
β19May 25, 2023Updated 3 years ago
Alternatives and similar repositories for fairseq_meta_mms_Google_Colab_implementation
Users that are interested in fairseq_meta_mms_Google_Colab_implementation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) projectβ54Jul 9, 2023Updated 2 years ago
- Gives the option to unload one or all models based on memory needs in your flow.β27Jun 30, 2024Updated 2 years ago
- scripts for cleaning and creating train/validation/test splits for Thai commonvoiceβ12Sep 2, 2021Updated 4 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language modelβ11Sep 1, 2024Updated last year
- Crosshair guidelines for ComfyUI to help align nodes and groups while moving or resizing.β38Apr 28, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- XML-RPC version of the Stanford POS taggerβ21Aug 25, 2010Updated 15 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.β11Jan 11, 2020Updated 6 years ago
- β10Jan 10, 2024Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ21May 20, 2025Updated last year
- Free Subliminal Textβ11Mar 28, 2019Updated 7 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.β13Oct 2, 2025Updated 9 months ago
- Automated image & video captioning using Qwen-VL, Gemma4 and SAM3.β91Apr 27, 2026Updated 2 months ago
- MTProto for Denoβ15Jul 6, 2024Updated last year
- SQL editor is a GUI for SQL. SQL editor is free, open source, Integrated Development Environment(IDE) for working with SQL in SQLite dataβ¦β12Apr 22, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β10Jun 13, 2021Updated 5 years ago
- Thai smart home corpus with "Gowajee" hotwordβ19Jul 30, 2023Updated 2 years ago
- See what is in the dark fully functioning in the browser.β20Jan 29, 2024Updated 2 years ago
- A simple no-code solution for integrating OpenAI's GPT language models into your google sheets documents using Apps Scriptsβ13Mar 31, 2025Updated last year
- β13Nov 24, 2025Updated 7 months ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>β19Jan 23, 2022Updated 4 years ago
- Bill Generator in Turkishβ14Sep 12, 2014Updated 11 years ago
- Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.β11Jan 25, 2023Updated 3 years ago
- εδΈͺ comfyui εΎηε ε―ζ©ε±β12Sep 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Search for pronuncations in different languagesβ11Nov 2, 2024Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β57May 7, 2023Updated 3 years ago
- ComfyUI ShadowR Wrapperβ17Feb 21, 2025Updated last year
- My guide to create an italian TTS with Coquiβ14Feb 2, 2022Updated 4 years ago
- Tesseract OCR box file web editorβ12Jun 22, 2023Updated 3 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorchβ14Jul 6, 2022Updated 3 years ago
- Automatic1111 to InvokeAI prompt resolverβ17Jun 29, 2024Updated 2 years ago
- This node is base on VisualCloze method, A Universal Image Generation Framework via Visual In-Context Learningβ11May 21, 2025Updated last year
- Fine-tuned KoGPT2 chatbot demo with translated PersonaChat (ongoing)β13Apr 17, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official ComfyUI node for the Unprompted templating language.β12Nov 13, 2024Updated last year
- implementation of paint-by-example on comfyuiβ11Apr 30, 2026Updated 2 months ago
- A set of Custom Nodes for Compositing for ComfyUIβ16Nov 24, 2024Updated last year
- Image inpainting system frontendβ14Jan 31, 2023Updated 3 years ago
- Automatically generate osu! beatmaps with T5 model.β11Sep 19, 2023Updated 2 years ago
- A deep learning approach for respiratory audio discovery and classification.β14Sep 30, 2024Updated last year
- Design, Architecture and Documentation for conversion of Apps into Super Apps Topicsβ19May 9, 2025Updated last year