bzhangGo/st_from_scratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bzhangGo/st_from_scratch)

bzhangGo / st_from_scratch

Revisiting End-to-End Speech-to-Text Translation From Scratch

☆13

Alternatives and similar repositories for st_from_scratch

Users that are interested in st_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / insertion-deletion-ddpm
View on GitHub
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30May 31, 2022Updated 4 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
lakmalz / ColouringImageFloodFill
View on GitHub
Android - colouring images using android native development kit (NDK) c++.using algorithm is floodfill algorithm
☆24Oct 18, 2023Updated 2 years ago
shuo-git / VecConstNMT
View on GitHub
☆25Oct 22, 2022Updated 3 years ago
raj-sutariya / gujarati_speech_recognition
View on GitHub
Offline speech recognition for Gujarati Language.
☆22Dec 20, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
huybery / GDPnet
View on GitHub
GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)
☆11Nov 21, 2021Updated 4 years ago
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
salu133445 / bach-violin-dataset
View on GitHub
A collection of high-quality public recordings of Bach's sonatas and partitas for solo violin (BWV 1001–1006)
☆39Feb 19, 2022Updated 4 years ago
fengpeng-yue / speech-to-speech-translation
View on GitHub
☆25Feb 12, 2023Updated 3 years ago
Shivam0712 / End-to-End_Speech-to-Text_Translation
View on GitHub
An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most s…
☆17Jul 14, 2019Updated 7 years ago
inspire-group / proxy-distributions
View on GitHub
[ICLR 2022 official code] Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness?
☆29Mar 15, 2022Updated 4 years ago
salu133445 / deepperformer
View on GitHub
Deep Performer: Score-to-audio music performance synthesis
☆47Jun 26, 2023Updated 3 years ago
CodeChefMUST / Algorithms
View on GitHub
A repository by Codechef@MUST for data structures and algorithms
☆20Oct 22, 2022Updated 3 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Allen-lz / audio2face_pytorch
View on GitHub
☆12Aug 15, 2022Updated 3 years ago
joshua-decoder / fisher-callhome-corpus
View on GitHub
The Fisher and CALLHOME Spanish–English Speech Translation Corpus
☆41Feb 10, 2022Updated 4 years ago
facebookresearch / evaluation-of-nmt-bt
View on GitHub
This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …
☆15Aug 31, 2021Updated 4 years ago
megagonlabs / holobench
View on GitHub
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Feb 25, 2025Updated last year
Glaciohound / Chimera-ST
View on GitHub
A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021
☆47Feb 21, 2022Updated 4 years ago
tts-tutorial / ijcai2021
View on GitHub
☆13Apr 4, 2023Updated 3 years ago
allenai / natural-perturbations
View on GitHub
Natural Perturbation for Robust Question Answering
☆12Apr 7, 2020Updated 6 years ago
ketranm / sa-nmt
View on GitHub
structured attention encoder
☆13Jun 6, 2018Updated 8 years ago
LouisBearing / UnconditionalHeadMotion
View on GitHub
Code & demo for the animation of still facial landmarks from an initial pose.
☆15Jan 19, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thunlp / VERNet
View on GitHub
Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
☆42Jul 2, 2021Updated 5 years ago
sellerskyle / audio-effect-suite
View on GitHub
☆10Jul 22, 2021Updated 4 years ago
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
sofiaherrero / lime-ner
View on GitHub
lime-ner: extending LIME for Named Entity Recognition
☆10Aug 15, 2018Updated 7 years ago
Dianezzy / ParaLip
View on GitHub
Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code
☆109May 1, 2022Updated 4 years ago
shubham16g / SimpleWallpaperApp-Android
View on GitHub
A simple demo wallpaper app which contains wallpaper links (json file) in assets folder.
☆32Jul 30, 2023Updated 2 years ago
SpeechResearch / speechresearch.github.io
View on GitHub
☆44Jun 10, 2024Updated 2 years ago
beat2022dataset / beat
View on GitHub
☆13Mar 30, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
THUNLP-MT / DirectQuote
View on GitHub
A Dataset for Direct Quotation Extraction and Attribution in News Articles.
☆14Sep 28, 2021Updated 4 years ago
CPJKU / composer_concept
View on GitHub
Supervised and unsupervised Concept-based explanation of pretrained music classifiers
☆12Jul 27, 2023Updated 2 years ago
shingt / FuzzyCMeans
View on GitHub
Fuzzy C-Means Clustering implementation using C++ and OpenCV interface.
☆15Feb 6, 2016Updated 10 years ago
EricLee0224 / McADTR
View on GitHub
[AIR-DISCOVER Summer Research] Multi-class Anomaly Detection Transformer with Heterogenous Knowledge Distillation
☆14Nov 11, 2024Updated last year
AwalkZY / CPN
View on GitHub
Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”
☆10Apr 3, 2022Updated 4 years ago
hallogameboy / QDS-Transformer
View on GitHub
☆16Sep 28, 2020Updated 5 years ago
Rahulsinghcreator / extraxted
View on GitHub
☆32Feb 29, 2024Updated 2 years ago