kehanlu / Prompt-Whisper
View external linksLinks

☆10

Alternatives and similar repositories for Prompt-Whisper

Users that are interested in Prompt-Whisper are comparing it to the libraries listed below

Sorting:

Alfred0622 / HypR
View on GitHub
A benchmark corpus for ASR hypothesis revising task
☆21Sep 26, 2023Updated 2 years ago
kehanlu / server-monitor
View on GitHub
A light webserver for monitoring RAM and GPU usage on multiple servers.
☆21Mar 31, 2021Updated 4 years ago
kehanlu / python
View on GitHub
臺科大程式設計社 2019 spring
☆25May 28, 2019Updated 6 years ago
kehanlu / University
View on GitHub
臺科併校小幫手 🍡
☆13Apr 21, 2023Updated 2 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆17Jul 22, 2024Updated last year
kehanlu / Mandarin-Wav2Vec2
View on GitHub
Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
jasonppy / PromptingWhisper
View on GitHub
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
☆150Jan 16, 2024Updated 2 years ago
AVGRadmin / Deep-Live-Cam-Multi-Language
View on GitHub
real time face swap and one-click video deepfake with only a single image
☆11Sep 13, 2024Updated last year
nexuslrf / GestureWave
View on GitHub
A hand-gesture recognition system using Doppler effect of ultrasonic.
☆11Mar 2, 2019Updated 6 years ago
CiscoDevNet / g2p_seq2seq_pytorch
View on GitHub
Grapheme to phoneme model for PyTorch
☆43Jul 21, 2022Updated 3 years ago
leke-adewa / short-video-maker
View on GitHub
Create short vertical videos for TikTok, YouTube Shorts, and Instagram Reels using AI. Fully automated pipeline with traceability. 🚀🎥
☆16Updated this week
sakusaku3939 / YoloLSTM
View on GitHub
[ACM MobiSys 2024 Demo] Image-based Indoor Localization using Object Detection and LSTM
☆11Jun 18, 2025Updated 7 months ago
carlosabalde / mobiledetect2vcl
View on GitHub
Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…
☆14Nov 13, 2023Updated 2 years ago
Handwritten-Equation-Solver / Handwritten-Equation-Solver
View on GitHub
An application to solve handwritten mathematical equations using deep learning algorithms.
☆13Apr 8, 2018Updated 7 years ago
EnVision-Research / FractFlow
View on GitHub
☆25Jul 28, 2025Updated 6 months ago
Dslab-NLP / Tibetan-PLM
View on GitHub
☆17Oct 8, 2023Updated 2 years ago
felixbur / Speechalyzer
View on GitHub
label and annotate large number of speech data files
☆12May 5, 2021Updated 4 years ago
fbinput / Tibetan-transliteration
View on GitHub
藏语威利转写
☆11Jul 19, 2016Updated 9 years ago
yousufkalim / mediapipe-pose-smooth
View on GitHub
This library removes the jitter and smooth the landmarks coming from Mediapipe
☆13Jan 16, 2023Updated 3 years ago
muhd-umer / deep-suppressor
View on GitHub
DeepSuppressor: A deep learning-based approach to speech denoising
☆12Dec 23, 2023Updated 2 years ago
iamcam / ai-wordpress-rag-demo
View on GitHub
This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…
☆11Apr 2, 2024Updated last year
ncclabsustech / BI-AI-2022-Lecture-Notes
View on GitHub
南科大研究生课BME5012 人脑智能与机器智能 2022秋
☆10Dec 12, 2022Updated 3 years ago
linh-nh / gstreamer-jitsi-meet
View on GitHub
jitsi meet video call with gstreamer
☆11Nov 25, 2021Updated 4 years ago
riaqn / python-nvidia-codec
View on GitHub
Pythonic Nvidia Codec Library
☆17Aug 3, 2022Updated 3 years ago
sky24h / Free-View_Expressive_Talking_Head_Video_Editing
View on GitHub
Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)
☆12May 26, 2024Updated last year
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
jaimevera1107 / tempdisagg
View on GitHub
A Python library for temporal disaggregation of time series data
☆21May 1, 2025Updated 9 months ago
kehanlu / DeSTA2
View on GitHub
Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"
☆120Jul 15, 2025Updated 6 months ago
Musawer1214 / Fight-Violence-detection-yolov8
View on GitHub
A trained model of YOLOv8 which will detect Fight or Violence and NonViolence in videos
☆12Sep 20, 2024Updated last year
andrewrproper / pandoc-folder
View on GitHub
Run pandoc on all matching files in a folder, to create one output document.
☆11Aug 29, 2022Updated 3 years ago
s920128 / NAR-BERT-ASR
View on GitHub
NAR-BERT-ASR
☆10Sep 27, 2021Updated 4 years ago
jjmlovesgit / SadTalker2
View on GitHub
Gradio_demo.py with Blinking on Still Mode Video Creation
☆12Jun 21, 2023Updated 2 years ago
yingrui / my-own.fun
View on GitHub
Build your own frontend AI agent with Chrome
☆13Jan 16, 2026Updated 3 weeks ago
g-farrow / soft_phone
View on GitHub
Python library for automated phone call testing using PJSIP
☆10Aug 24, 2017Updated 8 years ago
harsh19 / TRUCE
View on GitHub
Truth-Conditional Captions for Time Series Data. EMNLP 2021. Harsh Jhamtani, Taylor Berg-Kirkpatrick
☆13Feb 9, 2022Updated 4 years ago
homer6 / url
View on GitHub
C++17 URL Parser (RFC 3986 compliant)
☆11Jan 21, 2022Updated 4 years ago
helios-h2020 / h.extension-MediaStreaming-WebTorrent
View on GitHub
Showcase of P2P HLS streaming using WebTorrent
☆12May 5, 2021Updated 4 years ago
azzaouiyazid / Adaptive-Comb-Filtering-Algorithm-for-Harmonic-Signal-Enhancement
View on GitHub
An adaptive comb filtering algorithm for the enhancement of harmonic signals in the presence of additive white noise. The algorithm impro…
☆14Jan 10, 2023Updated 3 years ago
Berkeley-Speech-Group / DysfluentWFST
View on GitHub
DysfluentWFST
☆17Nov 13, 2025Updated 3 months ago

kehanlu / Prompt-WhisperView external linksLinks

Alternatives and similar repositories for Prompt-Whisper

kehanlu / Prompt-Whisper
View external linksLinks