Speaker prediction for captions on the Lex Fridman podcast
☆27Feb 14, 2024Updated 2 years ago
Alternatives and similar repositories for lexpod-speaker-prediction
Users that are interested in lexpod-speaker-prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Oct 29, 2022Updated 3 years ago
- An iOS swift app that detects objects using machine learning (CoreML, Vision)☆13Feb 26, 2023Updated 3 years ago
- ☆13Jun 12, 2024Updated 2 years ago
- Podalize: Podcast Transcription and Analysis☆158Sep 8, 2024Updated last year
- Zero-shot Audio Classification using Whisper☆79Dec 12, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple dim overlay on iOS MKMapView, covering entire map using MKOverlay and MKOverlayView with customisable colour and alpha values.☆12May 24, 2017Updated 9 years ago
- ☆19Nov 4, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- This project aims to make the Apache Jena Framework usable on Android☆16Apr 15, 2015Updated 11 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- WebRTC based voice activity detection☆22Jul 15, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 10 Bands Parametric Equalizer. Swift 4.☆15Oct 22, 2017Updated 8 years ago
- ☆17Feb 28, 2026Updated 3 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- A Swift script for detecting silence in audio files made with reactive programming in RxSwift☆13Apr 7, 2018Updated 8 years ago
- ☆20Mar 4, 2025Updated last year
- 语音切割,python ,webrtc☆11Sep 28, 2018Updated 7 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆22Apr 6, 2023Updated 3 years ago
- This is the pytorch\DGL implementation of the AMIGO paper.☆10Feb 6, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hed and supporting files for Chinese NNSVS Dataset Creation☆13Oct 14, 2025Updated 8 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 months ago
- 北京工业大学上网网关登录脚本☆13Jan 3, 2024Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Incognito Proxy chrome extension☆10Sep 27, 2023Updated 2 years ago
- ☆13Dec 7, 2018Updated 7 years ago
- Capture and compress video into H.264 with AVFoundation/VideoToolbox written in Swift☆25Aug 24, 2017Updated 8 years ago
- This is the project page for paper `CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective`, in CVPR2…☆13Mar 19, 2024Updated 2 years ago
- Examples of demo deployment using Gradio. Image Classification, Live Webcam Segmentation, APIs , Tunneling etc.☆17Oct 17, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Feb 14, 2025Updated last year
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 3 years ago
- FastAPI middleware for comparing different ML model serving approaches☆15Jul 5, 2023Updated 2 years ago
- One script that uses OpenAI to transcribe audio into text.☆15May 13, 2023Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Clustering-based methods for overlapping diarization☆85Jan 12, 2024Updated 2 years ago
- Front's developer resources☆28Aug 31, 2021Updated 4 years ago