Speaker prediction for captions on the Lex Fridman podcast
☆27Feb 14, 2024Updated 2 years ago
Alternatives and similar repositories for lexpod-speaker-prediction
Users that are interested in lexpod-speaker-prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transcription and Diarization based on OpenAI's Whisper☆25Sep 9, 2025Updated 8 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- This is the repository containing the solution of the homework for the CS224W course at Stanford: Machine Learning with Graphs☆11Jul 19, 2020Updated 5 years ago
- ☆13Jun 12, 2024Updated last year
- Podalize: Podcast Transcription and Analysis☆159Sep 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Example plugin for Rivet, showing how to execute a python script in a node☆11Nov 15, 2023Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Dec 12, 2022Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆17Jun 8, 2019Updated 6 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- This project aims to make the Apache Jena Framework usable on Android☆16Apr 15, 2015Updated 11 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Playing around drones with Android's Speech-to-text & Text-to-Speech; Setting up a Wake-up-word other than OK Google, and trying to match…☆12Apr 4, 2019Updated 7 years ago
- Provides hooks for various user-triggered events.☆52Aug 31, 2013Updated 12 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- Code repository for Qlik Sense Cookbook, published by Packt☆12Jan 18, 2023Updated 3 years ago
- Templates for musical textual inversion for riffusion☆11Apr 14, 2023Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- ☆20Mar 4, 2025Updated last year
- A narrow implementation of DiagramGPT for generating system architecture diagrams with local LLM models and Llama.cpp☆27May 28, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Hed and supporting files for Chinese NNSVS Dataset Creation☆13Oct 14, 2025Updated 6 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 2 months ago
- Tunneling service for Hyperswarm☆24Apr 22, 2020Updated 6 years ago
- 蜻蜓FM API☆12Jan 8, 2017Updated 9 years ago
- An edge agent framework built in pure Python☆23Dec 2, 2025Updated 5 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- ☆13Dec 7, 2018Updated 7 years ago
- Extract XML from the OS X dictionaries.☆36Sep 24, 2021Updated 4 years ago
- Ember Admin with a Twitter Bootstrap Theme☆25May 20, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- FastAPI middleware for comparing different ML model serving approaches☆15Jul 5, 2023Updated 2 years ago
- ☆17Nov 10, 2021Updated 4 years ago
- Parametrize and run scripts as notebooks with jupytext and papermill☆18Sep 29, 2019Updated 6 years ago
- ☆16Sep 11, 2023Updated 2 years ago
- One script that uses OpenAI to transcribe audio into text.☆15May 13, 2023Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago