openai/whisper + extra features
☆89Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for pywhisper
Users that are interested in pywhisper are comparing it to the libraries listed below
Sorting:
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆13Nov 6, 2022Updated 3 years ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Extension for stable diffusion webui to add advance prompt tuning☆10Nov 13, 2022Updated 3 years ago
- Summaries of machine learning papers☆12Aug 19, 2022Updated 3 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …☆12Jul 9, 2023Updated 2 years ago
- 📃 A curated list of all possible resources (tools, tutorials, platforms, etc) an andrew email can get you☆13Nov 15, 2024Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Automated av content transcription search for your website☆15Dec 17, 2025Updated 2 months ago
- Label images with LabelImg; Object detection with detectron2☆13Aug 20, 2021Updated 4 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆13May 17, 2020Updated 5 years ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 2 years ago
- Turkish Vision Language Model Development And Research☆16Aug 9, 2024Updated last year
- Experimental LLM interface exploring new ways to use AI to improve human thinking☆19Feb 27, 2026Updated last week
- a version of fast_Dreambooth by TheLastBen for kaggle notebook☆17Jun 1, 2023Updated 2 years ago
- Converts stable diffusion embeddings to loadable pngs☆40Dec 6, 2022Updated 3 years ago
- Streaming transcriber with whisper☆696May 1, 2023Updated 2 years ago
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Visualising Losses in Deep Neural Networks☆16Jul 17, 2024Updated last year
- ☆45May 4, 2025Updated 10 months ago
- An environment where you can try out faster-whisper immediately.☆38Nov 21, 2024Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Apr 13, 2023Updated 2 years ago
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- ☆40Dec 25, 2022Updated 3 years ago
- ☆16Jun 4, 2016Updated 9 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆21Jun 22, 2023Updated 2 years ago
- ☆17Apr 7, 2022Updated 3 years ago
- ☆20Jul 13, 2022Updated 3 years ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago
- Code for generating colinraffel.com and my CV☆16Feb 19, 2026Updated 2 weeks ago
- ☆20Mar 4, 2025Updated last year
- ☆20Sep 20, 2022Updated 3 years ago
- Cog wrapper for collabora/WhisperSpeech☆25Mar 5, 2024Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Sep 17, 2024Updated last year