沪语(上海话)ASR(语音识别)模型
☆30May 13, 2024Updated 2 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆11Nov 6, 2024Updated last year
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- b站视频音轨下载器(支持多P) Rebuild from https://github.com/Quandong-Zhang/bilibiliAudioDownloader.ps1 with python☆11Jul 31, 2025Updated 10 months ago
- 封装了百度、捷通华声和讯飞语音识别的库,以及捷通华声、民族语文翻译、小牛翻译的封装。☆15Sep 10, 2019Updated 6 years ago
- ☆13Apr 26, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 6 years ago
- Code & data for IJCAI'22 paper "Recipe2Vec: Multi-modal Recipe Representation Learning with Graph Neural Networks".☆14Jul 24, 2022Updated 3 years ago
- Source code for NAACL 2022 paper Weakly Supervised Text Classification using Supervision Signals from a Language Mode☆10Jun 13, 2022Updated 4 years ago
- super-resolution☆12Aug 2, 2019Updated 6 years ago
- c# library for decoding CTTransformer punc models, which can add punctuation to Chinese and English texts☆14Aug 18, 2025Updated 9 months ago
- This repo is the implementation of "A Neural Topic-Attention Model for Medical Term Abbreviation Disambiguation".☆15Dec 3, 2019Updated 6 years ago
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An annotation tool for rapid multi-task collaborative information extraction for knowledge graph construction.☆21Jun 12, 2025Updated last year
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 7 years ago
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago
- 百度网盘 Alfred workflow☆11Apr 23, 2021Updated 5 years ago
- A music visualizer for rainmeter☆14Jul 3, 2019Updated 6 years ago
- Music Visualizer made using openFrameworks (C++) and Essentia library. Creates 2D and 3D Perlin Noise based visualizations using frequenc…☆12May 23, 2018Updated 8 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆21Aug 23, 2025Updated 9 months ago
- 自然语言处理方面资料集☆10May 8, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A music visualizer written in C++.☆13Jun 5, 2017Updated 9 years ago
- ☆12Jul 2, 2018Updated 7 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- 美丽东自然语言处理百宝箱~命名实体识别,文本分类,语言模型,文本摘要。☆10Nov 28, 2022Updated 3 years ago
- CodeReadingNote pro supports jetbrains22.1.4+, code remark, custom tags, tags grouping topic, ongoing maintenance☆13Apr 12, 2026Updated 2 months ago
- Maximum entropy named-entity recognition (NER)☆13Dec 8, 2022Updated 3 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- 一个自建视频网站的工具,支持百度网盘作为数据后端☆12May 14, 2023Updated 3 years ago
- 使用 vue 框架模仿百度网盘☆10Oct 9, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detect…☆19Dec 15, 2021Updated 4 years ago
- ☆14Nov 22, 2022Updated 3 years ago
- 亿寻-百度网盘文件高速下载器源码☆11Dec 19, 2020Updated 5 years ago
- 兼容 GPT2、Bloom 等 Pytorch 框架下的语言模型、人工智能标记语言 (AIML) 和任务型对话系统 (Task) 的深度中文智能对话机器人框架☆25Jun 12, 2023Updated 3 years ago
- upgrade paddle-1.x to paddle-2.0☆12Mar 9, 2021Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 11 months ago