ga642381 / SpeechPrompt-v2Links

《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm

☆81

Alternatives and similar repositories for SpeechPrompt-v2

Users that are interested in SpeechPrompt-v2 are comparing it to the libraries listed below

Sorting:

WangHelin1997 / SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆76Updated last year
Hertin / WavPrompt
☆37Updated 3 years ago
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Updated 3 months ago
mutiann / speech_rankings
A CSRankings-like index for speech researchers
☆34Updated 9 months ago
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆82Updated 2 years ago
sinhat98 / adapter-wavlm
☆43Updated 2 years ago
yangdongchao / LLM-Codec
The open source code for LLM-Codec
☆137Updated 11 months ago
jishengpeng / WavReward
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
☆50Updated 2 months ago
ga642381 / SpeechGen
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
☆74Updated 2 years ago
DanielLin94144 / Full-Duplex-Bench
A benchmark to evaluate full-duplex spoken dialogue models on pause handling, backchanneling, turn-taking, and user interruptions.
☆54Updated last month
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆111Updated 2 years ago
yanghaha0908 / FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆94Updated 8 months ago
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆30Updated 2 years ago
richardbaihe / a3t
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
☆88Updated 11 months ago
tomasJwYU / AutoPrepDemo
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
☆31Updated last year
mct10 / RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
☆181Updated last year
Liangzheng-ZL / BEdit-TTS
Speech samples and code of BEdit-TTS
☆33Updated last year
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆49Updated last year
karthikbhamidipati / multi-task-speech-classification
Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset
☆27Updated 2 months ago
walker-hyf / NCSSD
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Updated 9 months ago
xinjli / alqalign
multilingual speech aligner
☆75Updated last year
ajd12342 / paraspeechcaps
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆127Updated 4 months ago
sky1456723 / Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆60Updated 3 years ago
Mikxox / EnCodec_Trainer
☆61Updated 2 years ago
mct10 / CoBERT
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆47Updated last year
nii-yamagishilab / VCC2020-database
☆52Updated 4 years ago
haoxiangsnr / llm-tse
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
☆42Updated last year
cpdu / unicats
☆63Updated last year
AI-Unicamp / TTS-Objective-Metrics
Objective metrics used in several text-to-speech (TTS) papers.
☆49Updated last month
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago