robd003/sph2pipe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/robd003/sph2pipe)

robd003 / sph2pipe

provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw

☆14

Alternatives and similar repositories for sph2pipe

Users that are interested in sph2pipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / contextual-attention-nlm
View on GitHub
Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.
☆14Jul 25, 2023Updated 2 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
IPS-LMU / soundChangeR
View on GitHub
soundChangeR: an agent-based model for simulating sound change
☆15Sep 3, 2025Updated 10 months ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
MashiMaroLjc / TimeExtractor
View on GitHub
针对口语进行时间抽取并标准化
☆13Mar 2, 2020Updated 6 years ago
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
sony / mmaudiosep
View on GitHub
☆16Apr 30, 2026Updated 2 months ago
KleberMotta / FakeFocus
View on GitHub
Unofficial fork from Universal Split Screen to fake focus on games running in the background
☆25Jan 5, 2021Updated 5 years ago
go-nlp / bm25
View on GitHub
bm25 is a scoring function that helps with information retrieval
☆14Sep 17, 2020Updated 5 years ago
syedecryptr / audio-spectogram-transformer
View on GitHub
Torch implementation of ViT based classifier for Audio classification
☆12May 22, 2022Updated 4 years ago
RicherMans / Dcase2018_pooling
View on GitHub
Repo for our pooling approach on the DCASE2018 task4
☆16Jul 6, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
VisualAIKHU / NoPrior_MultiSSL
View on GitHub
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)
☆16Sep 1, 2024Updated last year
kaistmm / TalkNCE
View on GitHub
Official implementation of TalkNCE (ICASSP 2024).
☆18Apr 30, 2025Updated last year
Capino512 / pinyin2hanzi_python
View on GitHub
词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文
☆15Mar 21, 2020Updated 6 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
View on GitHub
☆32Dec 2, 2020Updated 5 years ago
jackandsnow / craw_government_files
View on GitHub
crawl the public files of different governments through python 3.
☆15Aug 29, 2019Updated 6 years ago
xhzhao / PyTorch-MPI-DDP-example
View on GitHub
PyTorch-MPI-DDP-example
☆18Mar 21, 2018Updated 8 years ago
kaistmm / V2SFlow
View on GitHub
[ICASSP 2025] V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow
☆21Jun 3, 2025Updated last year
wndvlf96 / HAD-ANC
View on GitHub
☆12May 22, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kaistmm / voxsim_trainer
View on GitHub
[INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset
☆24Sep 29, 2025Updated 9 months ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 3 months ago
yinruiqing / fsmn
View on GitHub
Feedforward Sequential Memory Networks
☆18Aug 2, 2022Updated 3 years ago
kevin-keraudren / randomferns-python
View on GitHub
Why Random Ferns? Because 10 lines of code [1].
☆22Feb 25, 2016Updated 10 years ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
jiangjyjy / Yue-Benchmark
View on GitHub
[NAACL 2025] How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models
☆13Jul 2, 2025Updated last year
NICE-FUTURE / tfidf-cosine-text-recommendation
View on GitHub
【Demo】对新闻标题使用TF-IDF向量化和cosine相似度计算完成相似标题推荐
☆14Mar 2, 2020Updated 6 years ago
yuhogun0908 / MISOnet
View on GitHub
Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)
☆52Jan 13, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ewwink / wikipedia-wordlists-extractor
View on GitHub
Extract Unique Word Lists From Wikipedia Database
☆13May 27, 2020Updated 6 years ago
hbredin / DomainAdversarialVoiceActivityDetection
View on GitHub
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Mar 3, 2020Updated 6 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
Effort-Hepan / Deep-learning-experiment
View on GitHub
基于LSTM+CNN的自然语言处理，基于单维LSTM、多维LSTM时序预测算法和多元线性回归算法的预测模型
☆11May 8, 2025Updated last year
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
mengshiY / RCSF
View on GitHub
Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021
☆11Aug 24, 2021Updated 4 years ago
MorenoLaQuatra / audioset-download
View on GitHub
This package aims at simplifying the download of the AudioSet dataset.
☆60Jul 17, 2025Updated last year