fabio-sim/Fast-SeamlessM4T-ONNX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fabio-sim/Fast-SeamlessM4T-ONNX)

fabio-sim / Fast-SeamlessM4T-ONNX

ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

☆43

Alternatives and similar repositories for Fast-SeamlessM4T-ONNX

Users that are interested in Fast-SeamlessM4T-ONNX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fabio-sim / DocShadow-ONNX-TensorRT
View on GitHub
ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀
☆25Sep 13, 2023Updated 2 years ago
ahmedssabir / Belief-Revision-Score
View on GitHub
Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022
☆11Apr 13, 2025Updated last year
funcwj / pydecoder
View on GitHub
A python wrapper for kaldi-online-decoder using Cython
☆12Sep 1, 2017Updated 8 years ago
skysbird / g2p-zh-en
View on GitHub
Chinese and English Bilinguish G2P
☆22Jul 16, 2023Updated 3 years ago
ngovinhtn / JaViCorpus
View on GitHub
☆16Aug 23, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
madhu1995-oss / Pronunciation-and-Fluency-evaluation-using-machne-learning-and-DeepLearning
View on GitHub
☆13Apr 9, 2021Updated 5 years ago
wavelandspeech / wavelandspeech.github.io
View on GitHub
https://wavelandspeech.github.io/
☆10Jan 12, 2024Updated 2 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
SoonSYJ / fawasr
View on GitHub
FunASR安卓端侧离线版本2pass全模式
☆15Sep 4, 2023Updated 2 years ago
npuichigo / ttsflow
View on GitHub
tensorflow speech synthesis c++ inference for voicenet
☆16Mar 29, 2019Updated 7 years ago
Yanik39 / TORNet
View on GitHub
Full featured web server for TOR Hidden Services with Vanguards, NGINX, PHP-FPM, MariaDB, NYX, Supervisor and dnsmasq. One Container for …
☆13Aug 12, 2022Updated 3 years ago
lallubharteja / KWS-Scripts
View on GitHub
Keyword Search Recipe for Subword ASR
☆30Jul 12, 2019Updated 7 years ago
helixml / chat-widget
View on GitHub
An embeddable widget for interacting with openAI api compatable LLM's
☆15Sep 18, 2024Updated last year
ZillaRU / ChatTTS-ONNX
View on GitHub
ChatTTS is a generative speech model for daily dialogue.
☆14Oct 21, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wslyyy / LinguaDB
View on GitHub
一个类似llama_index的极简GO版本框架(Qdrant + embedding + openai + gin)，本地知识库QA应用后端
☆11Jul 12, 2024Updated 2 years ago
andrew-fennell / CogNative
View on GitHub
Translated vocal synthesis - Clone a voice and output speech in another language
☆26May 3, 2022Updated 4 years ago
zhangnn520 / digitalAvatarRealtime
View on GitHub
基于DINet的推理服务，推理视频流和视频
☆17Nov 8, 2023Updated 2 years ago
voidful / ipa2
View on GitHub
Tools for convert Text to IPA in python
☆19Feb 11, 2023Updated 3 years ago
lovemefan / paraformer.cpp
View on GitHub
Port of Funasr's Paraformer model in C/C++
☆43Jun 19, 2024Updated 2 years ago
vivek-nexus / listen-v4
View on GitHub
A text to speech web application that speaks word, sentences or even long articles in a music player like interface.
☆10Feb 15, 2025Updated last year
divvun / giellakbd-android
View on GitHub
A fork of LatinIME (by Google for Android), targeting marginalised languages that also deserve first-class status on mobile operating sys…
☆14Jul 3, 2026Updated 3 weeks ago
yuwchen / MultiPA
View on GitHub
☆21Jun 25, 2026Updated last month
mispchallenge / MISP-2023-Challenge-Baseline
View on GitHub
☆25Jan 2, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SaltyAom / nhql
View on GitHub
Unofficial GraphQL Reverse Proxy Server for nHentai written in Rust
☆12May 8, 2022Updated 4 years ago
CoEDL / vad-sli-asr
View on GitHub
A pipeline to isolate and transcribe one language in mixed-language speech
☆20Oct 25, 2022Updated 3 years ago
doheejin / SB_loss_PA
View on GitHub
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
☆22Apr 29, 2024Updated 2 years ago
guardrails-ai / validator-template
View on GitHub
A test validator repo that includes just the regex validator
☆15Mar 3, 2026Updated 4 months ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
jecktor / kolabr
View on GitHub
Real-time collaborative kanban board web application.
☆16Mar 20, 2024Updated 2 years ago
prinshul / tensorsensor
View on GitHub
☆21Oct 7, 2020Updated 5 years ago
anqorithm / image-processing-service-async
View on GitHub
This repository contains an asynchronous image processing service built using Golang, Asynq, Redis, Fiber and Docker Compose for easy dep…
☆10Dec 13, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
alikaratana / SpeakerRecognition
View on GitHub
Text-Dependent Speaker Recognition System with Machine Learning Techniques
☆10Dec 31, 2017Updated 8 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
InBrowserApp / uuid.inbrowser.app
View on GitHub
🆔 UUID InBrowser.App is a tool to generate and decode UUIDs. Fully runs in your browser, no data is sent to the server. Fast, secure, an…
☆10Nov 13, 2023Updated 2 years ago
carl03q / AudioClassifier
View on GitHub
A CNN audio classifier via spectrogram images.
☆10Jul 21, 2017Updated 9 years ago
atomicoo / chn_text_norm
View on GitHub
Chinese text normalization. 中文文本规范化。
☆60May 3, 2021Updated 5 years ago