guangkun0818/speech2text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guangkun0818/speech2text)

guangkun0818 / speech2text

Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.

☆12

Alternatives and similar repositories for speech2text

Users that are interested in speech2text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
asindel / SliTraNet
View on GitHub
Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"
☆10Dec 17, 2023Updated 2 years ago
lisjin / hct
View on GitHub
Hierarchical Context Tagger for utterance rewriting
☆13Mar 27, 2022Updated 4 years ago
sahanbull / context-agnostic-engagement
View on GitHub
This repository contains the VLEngagement dataset and the helper functions/ tools required to work with the dataset.
☆16Dec 3, 2021Updated 4 years ago
skit-ai / N-Best-ASR-Transformer
View on GitHub
Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."
☆17Nov 30, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
tatHi / optok
View on GitHub
☆10Aug 26, 2021Updated 4 years ago
Observeai-Research / Phoneme-BERT
View on GitHub
☆34Jun 15, 2021Updated 5 years ago
AI-Eden / eden-skills
View on GitHub
Deterministic & Blazing-Fast Skills Manager for AI Agents (Claude Code, Cursor, Codex & More).
☆29Apr 15, 2026Updated 3 months ago
LC044 / MiniC
View on GitHub
MiniC语言编译器前端，生成抽象语法树，产生线性IR，生成控制流图
☆25Mar 25, 2024Updated 2 years ago
GarryLau / DataAugmentation
View on GitHub
Caffe Image Data Augmentation
☆15May 11, 2018Updated 8 years ago
tarun-bisht / wav2vec2-asr
View on GitHub
wav2vec2 asr with transformers
☆16Oct 26, 2021Updated 4 years ago
nlp-tlp / mwo2kg-and-echidna
View on GitHub
Source code for MWO2KG and Echidna: Constructing and Exploring Knowledge Graphs from Maintenance Data
☆10Feb 13, 2023Updated 3 years ago
Web3KeyTalking / web3book
View on GitHub
☆14Jan 9, 2025Updated last year
KeiKinn / ParaCLAP
View on GitHub
Towards a general language-audio model for computational paralinguistic tasks
☆30Dec 14, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
s-serenity / Deep-learning-for-channel-encoding-and-decoding
View on GitHub
My graduation project.
☆13Oct 12, 2023Updated 2 years ago
mx-strk / InformationBottleneckDecodingLDPC
View on GitHub
Decoding of LDPC Codes Using the Information Bottleneck Method in Python
☆17Dec 11, 2018Updated 7 years ago
jiwidi / las-pytorch
View on GitHub
Listen, Attend and spell model for E2E ASR. Implementation in Pytorch
☆42Jun 22, 2022Updated 4 years ago
AmeanAsad / Syndrome-Error-Decoding
View on GitHub
A class that is able to develop any (n, k) linear code. Includes an implementation of an ASCII correcting linear code along with a simula…
☆12Apr 20, 2025Updated last year
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 3 months ago
PhanHuyThong / Image-Reconstruction-by-CNN-based-PGD
View on GitHub
Framework to train CNN and use it in Relaxed Projected Gradient Descent (RPGD) to reconstruct images
☆13Nov 25, 2019Updated 6 years ago
KMCS-NII / PDFNLT-1.0
View on GitHub
Tools for Natural Language Text aware PDF structure analysis
☆15Mar 11, 2022Updated 4 years ago
fabriziocarpi / RLdecoding
View on GitHub
Reinforcement Learning for Bit Flipping decoding of linear codes
☆14Sep 12, 2020Updated 5 years ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
haimengzhao / CAE-ADMM
View on GitHub
CAE-ADMM: Implicit Bitrate Optimization via ADMM-Based Pruning in Compressive Autoencoders
☆47Jul 22, 2020Updated 5 years ago
ewwink / wikipedia-wordlists-extractor
View on GitHub
Extract Unique Word Lists From Wikipedia Database
☆13May 27, 2020Updated 6 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
wadayama / overloaded_MIMO
View on GitHub
Deep learning aided iterative detection algorithm for massive overloaded MIMO channels
☆13Mar 5, 2019Updated 7 years ago
mengshiY / RCSF
View on GitHub
Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021
☆11Aug 24, 2021Updated 4 years ago
Serega6678 / NuNER
View on GitHub
NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition
☆15Jun 11, 2024Updated 2 years ago
jungm2018 / communications_neural_net
View on GitHub
Implementation of Neural Nets for Communications Channel Decoding using Log Likelihood Ratios
☆16Nov 19, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
may- / joeys2t
View on GitHub
Minimalist Speech-to-Text toolkit for educational purposes
☆13Feb 1, 2024Updated 2 years ago
alexpovel / betterletter
View on GitHub
Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …
☆11Nov 24, 2024Updated last year
dksanyal / SpERT.PL
View on GitHub
Joint Neural Model for Entity & Relation Extraction
☆16Oct 18, 2021Updated 4 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
DongPoLI / Mul-BERT
View on GitHub
mul-BERT, the official score on the SemEval 2010 Task 8 dataset is up to 90.72 (Macro-F1).
☆16Jan 11, 2021Updated 5 years ago
DianboWork / M3T-CNERTA
View on GitHub
☆11Aug 10, 2022Updated 3 years ago