sidhantls/lexpod-speaker-prediction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sidhantls/lexpod-speaker-prediction)

sidhantls / lexpod-speaker-prediction

Speaker prediction for captions on the Lex Fridman podcast

☆26

Alternatives and similar repositories for lexpod-speaker-prediction

Users that are interested in lexpod-speaker-prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
LinguisticAnomalies / harmonized-toolkit
View on GitHub
Toolkit for Reproducible Execution of Speech, Text and Language Experiments
☆10Mar 24, 2026Updated 4 months ago
ATLTVHEAD / Atltvhead-Gesture-Recognition-Bracer
View on GitHub
Atltvhead Gesture Recognition Bracer - A TensorflowLite gesture detector for the atltvhead project and for exploration into Data Science
☆18Feb 2, 2021Updated 5 years ago
calhounpaul / GPT-NeoX-20B-8bit-inference
View on GitHub
☆13Jun 12, 2024Updated 2 years ago
yassinchabeb / voice-IT
View on GitHub
Playing around drones with Android's Speech-to-text & Text-to-Speech; Setting up a Wake-up-word other than OK Google, and trying to match…
☆12Apr 4, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mave5 / podalize
View on GitHub
Podalize: Podcast Transcription and Analysis
☆157Sep 8, 2024Updated last year
jumon / zac
View on GitHub
Zero-shot Audio Classification using Whisper
☆79Dec 12, 2022Updated 3 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
cybertronai / bflm
View on GitHub
☆17Jun 8, 2019Updated 7 years ago
bjnortier / whisper-tflite-ios
View on GitHub
☆19Nov 4, 2022Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Dom3442 / leafonlysam
View on GitHub
☆11Dec 6, 2024Updated last year
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
ltillmann / pdf-redactor
View on GitHub
CLI tool to easily redact sensitive information from PDF files.
☆16Mar 12, 2026Updated 4 months ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
katzurik / Knowledge_Navigator
View on GitHub
☆19Mar 4, 2025Updated last year
vatsalsaglani / local-diagramgpt
View on GitHub
A narrow implementation of DiagramGPT for generating system architecture diagrams with local LLM models and Llama.cpp
☆27May 28, 2024Updated 2 years ago
raminnakhli / AMIGO
View on GitHub
This is the pytorch\DGL implementation of the AMIGO paper.
☆10Feb 6, 2024Updated 2 years ago
mallorbc / GPTNeoX20B_HuggingFace
View on GitHub
☆22Apr 6, 2023Updated 3 years ago
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Archivoice / nnsvs-chinese-support
View on GitHub
Hed and supporting files for Chinese NNSVS Dataset Creation
☆13Oct 14, 2025Updated 9 months ago
chrisluedtke / divvy-data
View on GitHub
Python API and analysis of Chicago's bikeshare
☆10Dec 8, 2022Updated 3 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
golemfactory / sp-wasm
View on GitHub
SpiderMonkey-based Wasm sandbox
☆22May 29, 2020Updated 6 years ago
ZinggJM / GxEPD2_4G
View on GitHub
☆52Mar 18, 2025Updated last year
cheenwe / qingting_api
View on GitHub
蜻蜓FM API
☆13Jan 8, 2017Updated 9 years ago
chinawilon / fcm_game_go
View on GitHub
网络游戏防沉迷实名认证，使用测试码通过所有的测试案例，以及正式接口的调用。
☆13Jun 10, 2021Updated 5 years ago
yellbuy / go-ec-openapi
View on GitHub
电商平台API
☆12Feb 6, 2026Updated 5 months ago
aiplaybookin / gradio-demo
View on GitHub
Examples of demo deployment using Gradio. Image Classification, Live Webcam Segmentation, APIs , Tunneling etc.
☆17Oct 17, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
devilesk / dota-map-coordinates
View on GitHub
Custom game for dumping dota map entity coordinate data
☆17Nov 29, 2019Updated 6 years ago
Biano-AI / serving-compare-middleware
View on GitHub
FastAPI middleware for comparing different ML model serving approaches
☆15Jul 5, 2023Updated 3 years ago
solarmist / apple-peeler
View on GitHub
Extract XML from the OS X dictionaries.
☆36Sep 24, 2021Updated 4 years ago
mwouts / papermill_jupytext
View on GitHub
Parametrize and run scripts as notebooks with jupytext and papermill
☆18Sep 29, 2019Updated 6 years ago
victorGPT / Transcriptify
View on GitHub
One script that uses OpenAI to transcribe audio into text.
☆15May 13, 2023Updated 3 years ago
daveshap / AutoMuse
View on GitHub
☆17Nov 10, 2021Updated 4 years ago