Speaker prediction for captions on the Lex Fridman podcast
☆27Feb 14, 2024Updated 2 years ago
Alternatives and similar repositories for lexpod-speaker-prediction
Users that are interested in lexpod-speaker-prediction are comparing it to the libraries listed below
Sorting:
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- ☆19Nov 4, 2022Updated 3 years ago
- ☆20Mar 4, 2025Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- PDF parser powered by grobid☆28Jul 26, 2024Updated last year
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- Code repository for Qlik Sense Cookbook, published by Packt☆12Jan 18, 2023Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- Podalize: Podcast Transcription and Analysis☆160Sep 8, 2024Updated last year
- LLM Building Blocks for Python Course☆16Nov 17, 2025Updated 3 months ago
- Completely free Text-to-Speech (TTS) models with excellent Turkish support and multilingual capabilities. No development, just a comprehe…☆15Jul 2, 2025Updated 8 months ago
- A simple booking system, developed in screenful-sized steps☆13Oct 1, 2020Updated 5 years ago
- Incognito Proxy chrome extension☆10Sep 27, 2023Updated 2 years ago
- ☆18Jun 25, 2025Updated 8 months ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- a blog starter project☆11Oct 29, 2018Updated 7 years ago
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- Main Panax Documentation☆11Feb 12, 2016Updated 10 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- ☆12Mar 3, 2023Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆11Feb 25, 2025Updated last year
- Desktop Widget Manager. Think of conky, but with Python instead of Lua.☆13Jun 10, 2020Updated 5 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- ☆10Jun 12, 2023Updated 2 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- extending laughbot project to encoder-based transformer model finetuned on same dataset for humor classification☆10Jan 4, 2023Updated 3 years ago
- A list of various eye- and head-tracking software, products, etc. ℹ️ This is just a push-mirror. We develop here: https://codeberg.org/ey…☆18Aug 28, 2025Updated 6 months ago
- A javascript library for trigram indexing and finding. If you want to know more about trigrams and how to use them try the example, and r…☆12Dec 1, 2019Updated 6 years ago
- ☆11Jul 19, 2018Updated 7 years ago
- ☆10Apr 3, 2024Updated last year
- Radix Primitives Cheatsheet☆12Mar 11, 2022Updated 3 years ago
- ☆12Oct 21, 2023Updated 2 years ago
- DEPRECATED: Tool for checking data leaks of social media platforms☆10Feb 20, 2022Updated 4 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Simulating the fractional quantum Hall effect with neural network variational Monte Carlo☆20Sep 12, 2025Updated 5 months ago