CS224S / LINGUIST285 - Spoken Language Processing
☆24Feb 13, 2020Updated 6 years ago
Alternatives and similar repositories for cs224s
Users that are interested in cs224s are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Jul 5, 2018Updated 7 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated 3 weeks ago
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Finetuning VITS Efficiently☆33Nov 6, 2023Updated 2 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 4 years ago
- Neural Turing machine for source separation in Tensorflow☆18Aug 16, 2017Updated 8 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 9 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 3 years ago
- 这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成☆53Aug 5, 2018Updated 7 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆82Jun 28, 2021Updated 4 years ago
- ☆47Apr 16, 2023Updated 3 years ago
- ☆10Feb 2, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 👀 VITRina: VIsual Token Representations☆11Jun 15, 2023Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- Notebook from my blog☆15Apr 9, 2017Updated 9 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CARMA Streets is a component of CARMA ecosystem, which enables such a coordination among different transportation users. This component p…☆11May 14, 2026Updated 3 weeks ago
- DataFountain 疫情政务问答助手解决方案分享☆16May 2, 2020Updated 6 years ago
- Bilsem Python Dersleri☆10Sep 25, 2024Updated last year
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 10 months ago
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆12Aug 1, 2023Updated 2 years ago
- My solutions to coding problems sent by dailyinterviewpro.com implemented in Python☆16Jan 31, 2020Updated 6 years ago
- drone controls☆16Oct 30, 2020Updated 5 years ago
- ☆13Mar 26, 2019Updated 7 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆14Nov 1, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.☆18Dec 7, 2022Updated 3 years ago
- A full Python implementation of the ROUGE metric, especially for Chinese texts processing.☆16Nov 21, 2019Updated 6 years ago
- Vehicle (car) shapes for use with the Tikz LaTeX package.☆18Dec 16, 2023Updated 2 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- PythonRoboticsPaper☆18Sep 3, 2018Updated 7 years ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆23Jul 27, 2024Updated last year