skrbnv/javad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/skrbnv/javad)

skrbnv / javad

☆66

Alternatives and similar repositories for javad

Users that are interested in javad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iwaterxt / voiceprint
View on GitHub
text-independent speaker identification
☆12Apr 9, 2018Updated 8 years ago
Music-Mix / Mix-Net
View on GitHub
AI cover model for your own voice.
☆34Aug 14, 2024Updated last year
Innovative-Digitale-Medizin-IDM / voxtral-finetune
View on GitHub
This repository contains a fine-tuning script for the transcription task of Mistral's Voxtral model.
☆28Jul 31, 2025Updated 11 months ago
actionpower / google_cloud_storage
View on GitHub
Deno Library to upload files to GCS and obtain signed url
☆11Jan 16, 2024Updated 2 years ago
gaspardpetit / verbatim
View on GitHub
High accuracy code-switching whisper / qwen3 transcription
☆39Jun 17, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Gyeongmin47 / KoCHET-A-Korean-Cultural-Heritage-corpus-for-Entity-related-Tasks
View on GitHub
☆13Nov 30, 2022Updated 3 years ago
Adel-Moumen / fast_sligru
View on GitHub
☆12Mar 24, 2024Updated 2 years ago
adrianlyjak / kokoro-onnx-export
View on GitHub
☆22Apr 29, 2025Updated last year
AmbiqAI / nnse
View on GitHub
NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform
☆44Nov 13, 2025Updated 8 months ago
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
tal-z / SoundsLike
View on GitHub
A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.
☆15Oct 26, 2022Updated 3 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
mogwai / nanodrz
View on GitHub
Speaker Diarization with Transformers
☆70Jun 8, 2025Updated last year
JamesMcGuigan / elasticsearch-faiss-cosine-similarity-search
View on GitHub
Cosine Similary Search in ElasticSearch + FAISS GPU
☆12Mar 24, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
EtienneAb3d / WhisperTimeSync
View on GitHub
Synchronize Whisper's timestamps over an existing accurate transcription
☆165May 28, 2024Updated 2 years ago
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
ElJaviLuki / NetLogger
View on GitHub
Xposed module that hooks into various HTTP libraries to log network calls.
☆13Jan 4, 2023Updated 3 years ago
ignaciocervino / ClothingTaggerAI-POC
View on GitHub
On-device iOS clothing tagger powered by MLX-Swift.
☆17Mar 11, 2025Updated last year
i4Ds / whisper-finetune
View on GitHub
This repository contains code for fine-tuning the Whisper speech-to-text model.
☆24Jul 9, 2026Updated 2 weeks ago
i4Ds / whisper-prep
View on GitHub
Data preparation utility for the finetuning of OpenAI's Whisper model.
☆16Jun 18, 2026Updated last month
srepsa / launchr
View on GitHub
☆17Aug 8, 2021Updated 4 years ago
jim-schwoebel / nala_assistant
View on GitHub
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
☆31Jan 15, 2024Updated 2 years ago
YunusEmreAlps / Icarus
View on GitHub
Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)
☆17Jan 18, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ina-foss / InaGVAD
View on GitHub
Voice activity detection and speaker gender segmentation audiovisual corpus
☆16Jan 20, 2025Updated last year
ighodgao / mamba-speech-synthesis
View on GitHub
Jupyter Notebook running Mamba speech synthesis example on Determined AI. Based on https://2084.substack.com/p/2084-marcrandbot-speech-sy…
☆23Feb 8, 2024Updated 2 years ago
mipuc / hts-engine-world
View on GitHub
☆17Nov 17, 2020Updated 5 years ago
SSTDV-Project / HF-GAN
View on GitHub
☆11Jan 12, 2026Updated 6 months ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
xNul / codestral-mamba-for-vscode
View on GitHub
Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.
☆31Jul 18, 2024Updated 2 years ago
otsaloma / docsets
View on GitHub
Scripts to generate Dash docsets
☆17Jun 21, 2026Updated last month
meyskens / registry-usb
View on GitHub
Hosting a local Docker registry on a USB drive
☆15Aug 19, 2017Updated 8 years ago
tommyscodebase / gemini_chatbot_javascript
View on GitHub
A Javascript Chatbot built with the Gemini AI
☆10Jan 26, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
noritsuna / JPEGEncoder4Cortex-M
View on GitHub
JPEG Encoder for Cortex-M
☆14Sep 2, 2016Updated 9 years ago
lovemefan / campplus
View on GitHub
A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx
☆15Dec 16, 2023Updated 2 years ago
NeVerTools / pyNeVer
View on GitHub
A Python library for learning and verification of neural networks and other machine learning models
☆14Sep 18, 2025Updated 10 months ago
iamlemec / bert.cpp
View on GitHub
GGML implementation of BERT model with Python bindings and quantization.
☆57Feb 19, 2024Updated 2 years ago
rfcx / arbimon
View on GitHub
Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale
☆19Jul 20, 2026Updated last week
bghani / xcapi
View on GitHub
xcapi: A Python package for downloading animal sound recordings from xeno-canto API.
☆20May 29, 2026Updated 2 months ago
hodefoting / gedl
View on GitHub
a GEGL based video editor
☆20Aug 14, 2017Updated 8 years ago