IBM / watson-stt-wer-python
Utilities for transcribing a set of audio files with IBM Watson Speech to Text (STT), then analyzing the error rate of the STT transcription against a known-good transcription
☆26Updated 2 months ago
Alternatives and similar repositories for watson-stt-wer-python:
Users that are interested in watson-stt-wer-python are comparing it to the libraries listed below
- Scripts that run against Watson Assistant for K fold validation on training set, testing on blind test, and draw precision curves for com…☆79Updated 3 weeks ago
- Assistant Improve notebooks for Watson Assistant☆68Updated last year
- Dialog Flow Analysis Notebook for Watson Assistant☆28Updated 2 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆59Updated 3 years ago
- Dialog Skill Analysis framework for Watson Assistant☆41Updated 2 weeks ago
- Watson Machine Learning sample models, notebooks and apps.☆117Updated this week
- Hands-on Python + IBM Watson lab being presented at IBM Think2018 conference in March 2018☆18Updated 6 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- AI Agents, LLM Fine-tuning, Developer Productivity, Governance, IBM watsonx☆22Updated last month
- Various speech datasets made available to the public☆113Updated 2 months ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- ☆43Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- This GitHub repository is used for storing assets developed for analyzing Watson Assistant logs.☆11Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆132Updated 10 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆96Updated this week
- Create an LJSpeech structured voice dataset on wave input☆25Updated 4 months ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆15Updated 5 years ago
- ☆43Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- Sample applications that use IBM embeddable AI libraries and linked from https://dsce.ibm.com☆30Updated this week
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆101Updated 9 months ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆43Updated 3 years ago
- IBM-Generative-AI is a Python library built on IBM's large language model REST interface to seamlessly integrate and extend this service …☆255Updated 2 months ago
- ☆11Updated 3 years ago
- ☆66Updated 2 months ago