沪语(上海话)ASR(语音识别)模型
☆28May 13, 2024Updated last year
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆11Nov 6, 2024Updated last year
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- esp32-cam + micropython + flask + yolo打造web视频监控和目标检测☆11Jan 13, 2023Updated 3 years ago
- b站视频音轨下载器(支持多P) Rebuild from https://github.com/Quandong-Zhang/bilibiliAudioDownloader.ps1 with python☆11Jul 31, 2025Updated 7 months ago
- 封装了百度、捷通华声和讯飞语音识别的库,以及捷通华声、民族语文翻译、小牛翻译的封装。☆15Sep 10, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆10Nov 17, 2021Updated 4 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- 一步一步开发一个聊天机器人☆10Oct 9, 2018Updated 7 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- 将ESP32的视频内容通过公网发送到服务器☆10Apr 26, 2023Updated 2 years ago
- c# library for decoding K2 transducer Models,used in speech recognition (ASR)☆13Aug 20, 2025Updated 7 months ago
- Code & data for IJCAI'22 paper "Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks".☆14Jul 24, 2022Updated 3 years ago
- ☆10Oct 14, 2020Updated 5 years ago
- super-resolution☆12Aug 2, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This repo is the implementation of "A Neural Topic-Attention Model for Medical Term Abbreviation Disambiguation".☆15Dec 3, 2019Updated 6 years ago
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- An annotation tool for rapid multi-task collaborative information extraction for knowledge graph construction.☆21Jun 12, 2025Updated 9 months ago
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 6 years ago
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 7 months ago
- Airtest + Poco游戏自动化测试框架☆16Feb 7, 2022Updated 4 years ago
- Creating Time-lapse video using esp32 camera module with the help of micropython☆12Mar 13, 2021Updated 5 years ago
- 深蓝学院语音课程《语音识别从入门到精通》课程作业☆22Apr 2, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 聊天机器人完整版☆15Feb 5, 2021Updated 5 years ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆20Aug 23, 2025Updated 7 months ago
- 一个使用多路复用I/O模型的高效的WebQQ机器人, 其主要功能有 Python shell, 执行Python代码, 贴代码, 英汉互译☆76Sep 29, 2014Updated 11 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- 自然语言处理方面资料集☆10May 8, 2020Updated 5 years ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- ☆12Jul 2, 2018Updated 7 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- 美丽东自然语言处理百宝箱~命名实体识别,文本分类,语言模型,文本摘要。☆10Nov 28, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Maximum entropy named-entity recognition (NER)☆13Dec 8, 2022Updated 3 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- 一个自建视频网站的工具,支持百度网盘作为数据后端☆12May 14, 2023Updated 2 years ago
- 基于BERT+Biaffine结构的关系抽取模型☆12Feb 23, 2022Updated 4 years ago
- 使用 vue 框架模仿百度网盘☆10Oct 9, 2018Updated 7 years ago
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- ☆14Nov 22, 2022Updated 3 years ago