SpeechBrain中文文档
☆12Mar 20, 2021Updated 5 years ago
Alternatives and similar repositories for speechbrain-docs-zh-cn
Users that are interested in speechbrain-docs-zh-cn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arxiv automatically obtains the latest article service.☆11Apr 29, 2020Updated 6 years ago
- This is a complete online exam system☆10Dec 27, 2019Updated 6 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆37Aug 23, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆52Apr 20, 2020Updated 6 years ago
- Pytorch implement of DANet For Speech Separation☆21Jan 9, 2020Updated 6 years ago
- CTR Prediction on PyTorch☆14Sep 2, 2019Updated 6 years ago
- ☆12Dec 23, 2022Updated 3 years ago
- Combines the SSL Method MixMatch with a pre-trained model (EfficientNet) on a chest x-ray dataset.☆11Jun 22, 2019Updated 6 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Dec 6, 2017Updated 8 years ago
- Speech/Music discrimination using SampleCNN☆18May 30, 2025Updated 11 months ago
- Using spectral features for Instrument Classification in polyphonic musical clips☆15Apr 24, 2020Updated 6 years ago
- ☆14Jun 9, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- Addressing the confounds of accompaniments in singer identification☆18Mar 24, 2020Updated 6 years ago
- speaker diarization system using an LSTM☆23Jan 4, 2023Updated 3 years ago
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Feb 29, 2020Updated 6 years ago
- code for EMNLP2018 paper 'Associative-multichannel-autoencoder for multimodal word representation'☆13Aug 24, 2018Updated 7 years ago
- [CVPRW 2021] DUVE network for NTIRE 2021 Quality enhancement of heavily compressed videos - Track 3 Fixed bit-rate☆10Oct 17, 2024Updated last year
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- Algo Trade Multicharts Repo☆11Aug 23, 2020Updated 5 years ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆29Mar 31, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- ☆13Oct 23, 2017Updated 8 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Dec 28, 2018Updated 7 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Feb 10, 2023Updated 3 years ago
- AI-powered video object removal (diffusion inpainting under the hood).☆23Mar 9, 2026Updated 2 months ago
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- 使用HMM、N-Gram、BiLSTM、Bert等模型对中文语料分词并比较结果☆16Jul 2, 2022Updated 3 years ago
- ☆12Aug 14, 2018Updated 7 years ago
- NASH 2021 project... this may or may not end up working 🤷♂️☆12Dec 19, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Using Kaldi x-vector method to train speaker recognition model on aishell database.☆17Aug 19, 2021Updated 4 years ago
- ☆18Jun 14, 2025Updated 10 months ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- ☆34Nov 18, 2025Updated 5 months ago
- [AAAI'26] PyTorch code for our paper "QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-Resolution"☆32Jan 29, 2026Updated 3 months ago
- Speech Enhancement Generative Adversarial Network☆21May 26, 2020Updated 5 years ago
- Kaggle histopathologic cancer detection (playground) competition solution☆11Apr 25, 2019Updated 7 years ago