Perform SOTA Speech2Text on Long Audio Files with/without diarization Using Google Cloud Speech API
☆14Feb 21, 2022Updated 4 years ago
Alternatives and similar repositories for Speech2Text-for-Long-Audio-Files
Users that are interested in Speech2Text-for-Long-Audio-Files are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fontawesome for sciter.js☆10Feb 28, 2025Updated last year
- Trading algorithm for Bitcoins in USD on quantconnect.com☆13Jan 12, 2018Updated 8 years ago
- ☆13Mar 23, 2026Updated 2 months ago
- This GUI Program helps you download songs from Spotify.☆10Dec 16, 2021Updated 4 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 3 years ago
- YOURLS plugin that allows you to change the default behaviour of YOURLS to send 302 redirects instead of 301.☆12Nov 22, 2021Updated 4 years ago
- A Go-based web application for sending newsletters and allowing subscribers to manage their preferences. Designed for ease of deployment …☆23Jul 22, 2025Updated 10 months ago
- Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…☆11Apr 10, 2013Updated 13 years ago
- Lightweight Cryptocurrency Monitor☆15Feb 14, 2019Updated 7 years ago
- Twitter text processing library (auto linking and extraction of usernames, lists and hashtags). Based on the Java implementation by Matt …☆88Jul 28, 2014Updated 11 years ago
- Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021☆11Aug 24, 2021Updated 4 years ago
- Download every skillshare class you want. (thanks to HeckerNoHecking's api)☆13Aug 5, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition☆15Jun 11, 2024Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- A short dirty python script to ping a Flight Comparison website and send an email notifying if a maximum price (or below) is available.☆11Oct 17, 2012Updated 13 years ago
- Minimalist Speech-to-Text toolkit for educational purposes☆13Feb 1, 2024Updated 2 years ago
- Use Forgejo with Coolify just like GitHub! Native integration for browsing repos, automatic webhooks, and push-to-deploy☆19Jul 16, 2025Updated 10 months ago
- ☆11Oct 24, 2022Updated 3 years ago
- Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …☆11Nov 24, 2024Updated last year
- Converts Youtube URLs to Text with Speech Recognition☆27Oct 18, 2022Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Aug 10, 2022Updated 3 years ago
- Check Luxmed doctor appointment availability.☆12Dec 9, 2018Updated 7 years ago
- Code recipe for "Multimodal One-Shot Learning of Speech and Images"☆11Nov 21, 2018Updated 7 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Jun 12, 2023Updated 3 years ago
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- Generate embeddings for audio files (music, speech, sounds) and text using CLAP with llm☆22May 15, 2025Updated last year
- trailing stop loss daemon that tracks performance via Philips Hue☆11Apr 24, 2018Updated 8 years ago
- ☆18Jul 25, 2022Updated 3 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated 4 months ago
- opencv python script for face detection and recognition, people detection and more☆14Dec 20, 2016Updated 9 years ago
- a python implementation of the strategy as described in "Street Smarts: High Probability Short-Term Trading Strategies" by Linda Raschke☆14Oct 21, 2019Updated 6 years ago
- Tracking activity and sentiment in the crypto markets using the Twitter API, Reddit API, Google Trends, and other sources☆10Sep 23, 2019Updated 6 years ago
- ☆11Jul 16, 2024Updated last year
- Grazyna the polish irc bot☆11Jul 6, 2022Updated 3 years ago
- S3Zilla... an S3 File Transfer Client with a GUI developed using Tkinter☆12Feb 7, 2026Updated 4 months ago