MeloSphere/VoiceSubtitle

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MeloSphere/VoiceSubtitle)

MeloSphere / VoiceSubtitle

这是一个基于 Python 开发的实时语音字幕显示程序，可以将用户的语音实时转换为屏幕上的字幕文本。支持中文和英文识别，适用于 macOS 和 Windows 系统

☆29

Alternatives and similar repositories for VoiceSubtitle

Users that are interested in VoiceSubtitle are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yoongi43 / music_audio_enhancement_conformer
View on GitHub
Implementation of the paper "Exploiting Time-Frequency Conformers for Music Audio Enhancement"
☆14Mar 21, 2025Updated last year
stvlynn / ffmpeg-Dify-Plugin
View on GitHub
an ffmpeg plugin for Dify
☆19Dec 1, 2025Updated 7 months ago
nobmaste / QH_Learning_Resources
View on GitHub
☆12Aug 15, 2025Updated 11 months ago
Vanka0051 / speech_enhancement
View on GitHub
speech enhancement using DNN: [1] Xu, Y., Du, J., Dai, L.R. and Lee, C.H., 2015. A regression approach to speech enhancement based on dee…
☆14Sep 17, 2019Updated 6 years ago
TEDddr / Adap-WTD
View on GitHub
自适应的小波阈值降噪
☆14Aug 11, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Gloridust / whisper_streaming_CN
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆61Apr 9, 2024Updated 2 years ago
shaharpit809 / Speech-Denoising-using-DNN-CNN-and-RNN
View on GitHub
This repository consists of application of Speech Denoising using DNN, CNN (1D and 2D) and RNN (LSTM) in tensorflow.
☆16Jun 15, 2019Updated 7 years ago
Kuaizr / whisperDemo
View on GitHub
录制麦克风或者系统扬声器的声音，并实时翻译，自动纠错
☆16Jan 22, 2023Updated 3 years ago
mukyuuhate / SoundLocation
View on GitHub
基于pynq-z2的声源定位系统
☆14Nov 15, 2020Updated 5 years ago
nglehuy / sasegan
View on GitHub
Self-Attention Generative Adversarial Network for Speech Enhancement using Tensorflow 2
☆16Jan 30, 2021Updated 5 years ago
phecda-xu / FullyCNNSpeechEnhancement
View on GitHub
全卷积网络进行语音降噪
☆18Dec 8, 2021Updated 4 years ago
TechyNilesh / Speech-Enhancement-Noise-Suppression-Using-DTLN
View on GitHub
Speech Enhancement: Tensorflow 2.x implementation of the stacked dual-signal transformation LSTM network (DTLN) for Noise Suppression.
☆18May 1, 2021Updated 5 years ago
z-xr / SSL
View on GitHub
基于matlab的声源定位广义互相关算法的实现
☆16May 6, 2022Updated 4 years ago
bvhari / ComfyUI_SUNoise
View on GitHub
Scaled Uniform Noise for Ancestral & Stochastic samplers and Noisy latent image
☆17Mar 30, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
deependra227 / Real-Time-Audio-Filtering-using-Python
View on GitHub
Platform for Audio Filtering (Digital Filters) in Real-Time using Convolution Theorem and Fast Fourier Transform.
☆13Aug 16, 2021Updated 4 years ago
greengerong / ComfyUI-JanusPro-PL
View on GitHub
JanusPro ComfyUI plugin
☆12Feb 8, 2025Updated last year
YangangCao / SpeechSignalProcessing
View on GitHub
☆20Jun 10, 2019Updated 7 years ago
leeguandong / ComfyUI_FluxLayerDiffuse
View on GitHub
小红书的flux版本的透明图生成（layerdiffuse），支持文生图和图生图
☆18Mar 17, 2025Updated last year
PandoraLS / SpeechEnhancement
View on GitHub
语音增强
☆18Apr 19, 2021Updated 5 years ago
MongooseOrion / Audio_Time_Freq_Process_and_Trans
View on GitHub
FPGA based, Real-time processing of audio, including voiceprint recognition, adaptive noise suppression, et al.
☆17May 8, 2025Updated last year
andrewdalpino / MewZoom
View on GitHub
A family of image super-resolution models with purrfect pixels.
☆15Apr 29, 2026Updated 3 months ago
asagi4 / ComfyUI-NPNet
View on GitHub
https://github.com/xie-lab-ml/Golden-Noise-for-Diffusion-Models for ComfyUI
☆18Dec 10, 2024Updated last year
skeskinen / resemble-denoise-onnx-inference
View on GitHub
Inference of resemble denoiser
☆30Mar 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Reithan / negative_rejection_steering
View on GitHub
Extension for Forge-based UIs (Forge, reForge, etc) and ComfyUI to replace CFG with Negative Rejection Steering
☆16May 16, 2026Updated 2 months ago
noisereduce / TorchSpectralGating
View on GitHub
TorchSpectralGate is a PyTorch-based implementation of Spectral Gating, an algorithm for denoising audio signals.
☆27Feb 3, 2024Updated 2 years ago
Slickytail / ComfyUI-RegionalAdaptiveSampling
View on GitHub
Comfyui implementation of Regional Adaptive Sampling, for Flux and HunYuanVideo
☆23Jun 30, 2026Updated 3 weeks ago
AkenoSyuRi / DTLNPytorch
View on GitHub
This is an unofficial Pytorch implementation of the DTLN model repository, which contains denoising and inference code for the DTLN model…
☆23Jun 18, 2023Updated 3 years ago
ooshyun / Speech-Enhancement-Pytorch
View on GitHub
Pytorch Models for Speech Enhancement
☆23Mar 31, 2023Updated 3 years ago
RodrigoSKohl / comfyui-tryoff-anyone
View on GitHub
Node to tryoff clothes
☆23Apr 14, 2025Updated last year
muuda / MUSIC-algorithm-for-circular-microphone-array
View on GitHub
通过单层圆形麦克风阵列采集音频，实现MUSIC算法的声源定位。
☆23Mar 16, 2023Updated 3 years ago
AFun9 / Omnivoice-onnx
View on GitHub
☆18May 13, 2026Updated 2 months ago
feeling-cold / Noise-suppression-and-speech-recognition-systems
View on GitHub
基于傅里叶变换的降噪与基于深度学习的语音识别的多功能系统
☆15Jun 2, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
XiaoyuBIE1994 / DVAE_SE
View on GitHub
(TASLP 2022) Unsupervised speech enhancement using DVAEs
☆23Dec 16, 2024Updated last year
HJH-AILab / ComfyUI_StableAnimator
View on GitHub
ComfyUI nodes for StableAnimator
☆17Apr 24, 2025Updated last year
RahulSajnani / GeoDiffuser
View on GitHub
[WACV 2025, Best Student Paper, Oral] GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
☆22Mar 22, 2025Updated last year
lihaoyun6 / ComfyUI-SegmentAnything3
View on GitHub
ComfyUI SAM3 node based on transformers / 基于 transformers 框架的 SAM3分割节点
☆21Jan 29, 2026Updated 6 months ago
KimDonghwan06 / PARTE_RELEASE
View on GitHub
[ICCV 2025] This repo is an official PyTorch implementation of PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Ima…
☆17Sep 19, 2025Updated 10 months ago
Scorpinaus / ComfyUI-DiffusersLoader
View on GitHub
☆21Aug 26, 2024Updated last year
the-hexer / ComfyUI_Auto_Caption
View on GitHub
Using LLM and Joy tag pipeline to tag your image(s folder), it's suitable for train FLUX LoRA and also sdxl. Load images in order!
☆19Oct 24, 2025Updated 9 months ago