TransWithAI/slam-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TransWithAI/slam-asr)

TransWithAI / slam-asr

A pytoch lightning training implementation of SLAM-ASR

☆11

Alternatives and similar repositories for slam-asr

Users that are interested in slam-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiamingkong / slam_asr_pytorch
View on GitHub
Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert
☆22Jun 14, 2024Updated 2 years ago
fclearner / Personal-vad-2.0
View on GitHub
Implementation of "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"
☆16Jun 9, 2026Updated last month
Cryolite / mjai
View on GitHub
Standardization Project for mjai Format Specification
☆14Aug 28, 2024Updated last year
yuhangear / kaldi-android
View on GitHub
☆15Nov 5, 2021Updated 4 years ago
stdKonjac / DeepComplexCRN
View on GitHub
☆13Mar 22, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fema-ffrd / rashdf
View on GitHub
Read data from HEC-RAS HDF files.
☆18Mar 25, 2026Updated 4 months ago
hfellerhoff / openipa-old
View on GitHub
A free, fast, community-focused transcription tool to transcribe texts in Latin, French, German, and Italian into IPA.
☆11Feb 10, 2022Updated 4 years ago
fireredchat-submodules / livekit-plugins-fireredchat-pvad
View on GitHub
FireRedChat pVAD plugin for LiveKit Agents
☆22Sep 16, 2025Updated 10 months ago
neocl / jamdict-web
View on GitHub
Japanese Reading Assistant with morphological analyser, Japanese-English dictionary, Kanji dictionary, and Japanese Names dictionary
☆13Jun 5, 2021Updated 5 years ago
jo2lxq / wafl
View on GitHub
Code Space for Wireless Ad Hoc Federated Learning (WAFL) -- A Fully Autonomous Collaborative Learning with Device-to-Device Communication
☆20Apr 10, 2026Updated 3 months ago
HolgerBovbjerg / SSL-PVAD
View on GitHub
A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…
☆25Nov 25, 2024Updated last year
JarodMica / chatterbox
View on GitHub
SoTA open-source TTS
☆26Jul 8, 2025Updated last year
FlyA-official / FlyA-Agent
View on GitHub
☆16Apr 30, 2026Updated 2 months ago
OpenTSLab / TimeOmni
View on GitHub
[ICLR 2026] Official implementation of SciTS: Scientific Time Series Understanding and Generation with LLMs
☆17Mar 3, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Fat-pig-Cui / misc-code
View on GitHub
个人与雀魂有关的杂项文件与专栏 (另见: https://github.com/Fat-pig-Cui/majsoul-replay-editor)
☆17Jan 14, 2026Updated 6 months ago
LlmKira / wd14-tagger-server
View on GitHub
✨ waifu-diffusion tagger server / onnx | wd-tagger as api service
☆21Feb 20, 2025Updated last year
VijayLingam95 / SVFT
View on GitHub
☆35Feb 10, 2025Updated last year
ttlabtuat / SingLEM
View on GitHub
Implementation and pretrained model for the SingLEM paper.
☆15Jul 15, 2026Updated 2 weeks ago
benmaier / python-fisheye
View on GitHub
Transform single points or arrays of points using several fisheye functions.
☆11Aug 31, 2018Updated 7 years ago
Wolfda95 / MIRP_Benchmark
View on GitHub
MICCAI 25 Publication: Your other Left! Vision-Language Models Fail to Identify Relative Positions in Medical Images
☆15May 11, 2026Updated 2 months ago
pprablanc / ppsrt
View on GitHub
A python algorithm to change the pitch of the voice in real time
☆13Dec 13, 2020Updated 5 years ago
zhong-yy / volpick
View on GitHub
This repository contains the final models and the code to reproduce the model (downloading waveforms, formatting data into seisbench form…
☆16Jul 1, 2026Updated 3 weeks ago
DragonMeow1012 / DragonMeow-MangaTranslator
View on GitHub
漫畫圖片一鍵翻譯：偵測→OCR→LLM翻譯→抹字→嵌字，內建 localhost 網頁端，解壓即用。支援 Gemini/ChatGPT/Claude/Grok/DeepSeek 等多家 API。
☆67Jul 20, 2026Updated last week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
winktzhong / PocketMahjongClient
View on GitHub
口袋麻将全集客户端源码
☆21Mar 25, 2024Updated 2 years ago
voidful / asr-trainer
View on GitHub
one script for xls-r/xlsr/whisper fine-tuning
☆42Jun 29, 2023Updated 3 years ago
kvablack / nlg-gan
View on GitHub
An attempt to use a Generative Adversarial Network (GAN) for natural language generation.
☆15Jul 24, 2018Updated 8 years ago
eipm / bridge2ai-redcap
View on GitHub
Bridge2AI Voice | REDCap
☆16Jul 21, 2026Updated last week
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
chatdesk / grouphug
View on GitHub
Multi-task modelling extensions for huggingface transformers
☆21Mar 3, 2023Updated 3 years ago
xinyebei / 2024_finvcup_baseline
View on GitHub
2024 FinVolution Global Data Science Competition-9th baseline
☆20May 17, 2024Updated 2 years ago
amitness / ollama-remote
View on GitHub
Access Ollama via remote servers with tunneling
☆28Feb 16, 2025Updated last year
futz12 / uie_ncnn_windows
View on GitHub
UIE(Universal Information Extraction) infer by ncnn
☆15Sep 22, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
markusressel / fan2go-tui
View on GitHub
Terminal UI for fan2go.
☆37Jul 20, 2026Updated last week
speechLabBcCuny / EDANSA
View on GitHub
The Ecoacoustic Dataset from Arctic North Slope Alaska
☆12Jul 22, 2026Updated last week
Equim-chan / tensoul
View on GitHub
Convert MahjongSoul log into tenhou.net/6 format
☆36Nov 11, 2024Updated last year
PINTO0309 / onnx-aec
View on GitHub
A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.
☆13Oct 22, 2024Updated last year
ChristianFeldmann / LibFFmpeg
View on GitHub
LibFFmpeg++ is a C++ wrapper that can load the shared FFmpeg libraries in almost all versions on different platforms.
☆13Feb 20, 2026Updated 5 months ago
PINTO0309 / tflite-input-output-rewriter
View on GitHub
This tool displays tflite signatures and rewrites the input/output OP name to the name of the signature. There is no need to install Tens…
☆14Dec 13, 2023Updated 2 years ago
xiaoyangdu22 / QiandaoEar22
View on GitHub
☆20Mar 21, 2024Updated 2 years ago