LeeYongHyeok / DCM_vgg_transformerView external linksLinks
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using fairseq
☆14Jul 2, 2020Updated 5 years ago
Alternatives and similar repositories for DCM_vgg_transformer
Users that are interested in DCM_vgg_transformer are comparing it to the libraries listed below
Sorting:
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Nov 11, 2021Updated 4 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Jul 10, 2020Updated 5 years ago
- Joint Modelling Histology and Molecular Markers for Glioma Classification☆12Jun 4, 2025Updated 8 months ago
- Audio Visual Speech Recognition☆23Aug 9, 2017Updated 8 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Feb 22, 2022Updated 3 years ago
- 🎮 Use a Raspberry Pi to control a LoPy over UART☆12Mar 9, 2017Updated 8 years ago
- featselector是一个基于统计分析和模型选择的特征选择器.☆14Mar 4, 2019Updated 6 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆32Dec 7, 2022Updated 3 years ago
- Philo: uniting modalities☆26Mar 16, 2025Updated 11 months ago
- ☆36Sep 4, 2024Updated last year
- ☆28Feb 2, 2026Updated 2 weeks ago
- 给定一张身份证正、反面,识别身份证上的所有文字信息。☆10Sep 4, 2019Updated 6 years ago
- Comparison of Transfer Learning and CNN Architectures for Medical Image Classification☆13Dec 11, 2024Updated last year
- A Deepfake detector based on hybrid EfficientNet CNN and Vision Transformer archietcture. The model is explainable by rendering a heatma…☆15Mar 16, 2022Updated 3 years ago
- [IEEE TNSRE] Mixture of Experts for EEG-Based Seizure Subtype Classification☆12Aug 20, 2024Updated last year
- This is the codebase for MD-Dose: A diffusion model based on the Mamba for radiation dose prediction☆35Jun 22, 2025Updated 7 months ago
- (WWW'20) Official codes of paper "multimodal deep variational information bottleneck for micro-video popularity prediction".☆46Dec 9, 2021Updated 4 years ago
- Python toolkit for Visual Speech Recognition☆38Jun 10, 2020Updated 5 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- Official repository for "Pre- to Post-Contrast Breast MRI Synthesis for Enhanced Tumour Segmentation"☆12Jan 31, 2024Updated 2 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 3 years ago
- [ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture☆12Jan 17, 2025Updated last year
- A Generative Adversarial Network Model Alternative to Animal Studies for Clinical Pathology Assessment☆14Jan 10, 2024Updated 2 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey☆10Nov 7, 2024Updated last year
- This is the official GDSC repo with all of the source code presented in the video tutorials☆14Jun 27, 2023Updated 2 years ago
- Color Coherence Vector is a powerful color-based image retrieval (Matlab)☆11Feb 27, 2015Updated 10 years ago
- Demo for METSC: A microstructure estimation Transformer inspired by sparse representation for diffusion MRI (MedIA 2023).☆12Nov 13, 2023Updated 2 years ago
- The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》☆14Mar 31, 2025Updated 10 months ago
- Deep Learning Breast MRI Segmentation and Classification☆10Sep 11, 2025Updated 5 months ago
- Implementation of a simple linear regression algorithm in MAMBA☆10Feb 12, 2020Updated 6 years ago