Cosmos-Break/asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cosmos-Break/asr)

Cosmos-Break / asr

沪语（上海话）ASR（语音识别）模型

☆30

Alternatives and similar repositories for asr

Users that are interested in asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
lmaxwell / McHuo
View on GitHub
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
☆12Oct 19, 2023Updated 2 years ago
FlorenceZh / BilibiliAudioDownloader-rebuild
View on GitHub
b站视频音轨下载器（支持多P） Rebuild from https://github.com/Quandong-Zhang/bilibiliAudioDownloader.ps1 with python
☆11Jul 31, 2025Updated 11 months ago
AI-Hobbyist / StarRail_Voice_Sorting_Scripts
View on GitHub
☆13Apr 26, 2026Updated 2 months ago
AnuoF / asr_example_csharp
View on GitHub
封装了百度、捷通华声和讯飞语音识别的库，以及捷通华声、民族语文翻译、小牛翻译的封装。
☆15Sep 10, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
johnson7788 / label-studio
View on GitHub
Label Studio is a multi-type data labeling and annotation tool with standardized output format
☆10Nov 17, 2021Updated 4 years ago
ecoxial2007 / FGRW_MedVQA
View on GitHub
Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question
☆11Jul 18, 2024Updated 2 years ago
manyeyes / K2TransducerAsr
View on GitHub
c# library for decoding K2 transducer Models，used in speech recognition (ASR)
☆13Aug 20, 2025Updated 11 months ago
JeongHun0716 / e-mvsr
View on GitHub
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)
☆20Mar 17, 2025Updated last year
HKUST-KnowComp / WDDC
View on GitHub
Source code for NAACL 2022 paper Weakly Supervised Text Classification using Supervision Signals from a Language Mode
☆10Jun 13, 2022Updated 4 years ago
manyeyes / AliCTTransformerPunc
View on GitHub
c# library for decoding CTTransformer punc models, which can add punctuation to Chinese and English texts
☆14Aug 18, 2025Updated 11 months ago
IreneZihuiLi / TopicAttentionMedicalAD
View on GitHub
This repo is the implementation of "A Neural Topic-Attention Model for Medical Term Abbreviation Disambiguation".
☆15Dec 3, 2019Updated 6 years ago
ZygoteCode / VadSharp
View on GitHub
Enterprise VAD (Voice Activity Detection) in C#.NET (.NET 6.0+) with Microsoft.ML.Net, ONNXRuntime and DirectML. The easiest, efficient, …
☆10Apr 20, 2025Updated last year
zacksleo / pcs-alfred-workflow
View on GitHub
百度网盘 Alfred workflow
☆11Apr 23, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MenglingD / mandarin_speech_recognition
View on GitHub
基于深度学习的普通话语音识别
☆18Apr 23, 2019Updated 7 years ago
manyeyes / KaldiNativeFbankSharp
View on GitHub
c# wrapper for kaldi-native-fbank，used to extract audio features in speech recognition (ASR) task
☆10Jul 26, 2025Updated 11 months ago
murufeng / Awesome-NLP-Resources
View on GitHub
自然语言处理方面资料集
☆10May 8, 2020Updated 6 years ago
yic20 / CoMC
View on GitHub
[ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition
☆17Jul 9, 2024Updated 2 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
wahr0411 / PTADisc
View on GitHub
☆15Sep 20, 2025Updated 10 months ago
z-soulx / CodeReadingMarkNotePro
View on GitHub
CodeReadingNote pro supports jetbrains22.1.4+, code remark, custom tags, tags grouping topic, ongoing maintenance
☆13Apr 12, 2026Updated 3 months ago
zhangdddong / beautifulNLP
View on GitHub
美丽东自然语言处理百宝箱~命名实体识别，文本分类，语言模型，文本摘要。
☆10Nov 28, 2022Updated 3 years ago
YingyWang / NLPCC_2018_TASK2_GEC
View on GitHub
☆12Jul 2, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
xueyouluo / biaffine-bert-relation-extract
View on GitHub
基于BERT+Biaffine结构的关系抽取模型
☆12Feb 23, 2022Updated 4 years ago
melanietosik / maxent-ner-tagger
View on GitHub
Maximum entropy named-entity recognition (NER)
☆13Dec 8, 2022Updated 3 years ago
joshchang0111 / EMNLP2023-RumorDAS
View on GitHub
Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …
☆16Dec 14, 2023Updated 2 years ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
ICTMCG / MTM
View on GitHub
Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…
☆19Dec 15, 2021Updated 4 years ago
TIGER-AI-Lab / VisCoder
View on GitHub
The official code of "VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation" [EMNLP25]
☆19Sep 21, 2025Updated 10 months ago
JianmingS / Natural-language-understanding
View on GitHub
自然语言理解【词频统计 + 汉语自动分析】
☆10Aug 14, 2015Updated 10 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
PaddlePaddle / paddle_upgrade_tool
View on GitHub
upgrade paddle-1.x to paddle-2.0
☆12Mar 9, 2021Updated 5 years ago
frozentoad9 / CMST
View on GitHub
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Oct 12, 2022Updated 3 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
RiddleMa / cluster_zf
View on GitHub
方剂聚类
☆17Jul 18, 2018Updated 8 years ago
nlp-tlp / quickgraph
View on GitHub
An annotation tool for rapid multi-task collaborative information extraction for knowledge graph construction.
☆21Jun 12, 2025Updated last year
Hayeonbang / PIAST
View on GitHub
A piano music dataset with Audio, Symbolic and Text labels
☆36Mar 6, 2025Updated last year
YangHan-Morningstar / Bert-Chinese-ShortText-Classification
View on GitHub
基于Bert、Pytorch的中文短文本分类任务
☆13Nov 2, 2022Updated 3 years ago