PyTorch Implementation of SimulLR
☆11Dec 30, 2021Updated 4 years ago
Alternatives and similar repositories for SimulLR
Users that are interested in SimulLR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project for ZJU-Game-2021☆10Sep 20, 2021Updated 4 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- ☆135Feb 4, 2023Updated 3 years ago
- Comprehensive Information Integration Modeling Framework for Video Titling☆11Aug 27, 2020Updated 5 years ago
- A simple app for recording speech datasets.☆26Jun 27, 2022Updated 3 years ago
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- [CVPR2021] Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes☆92Jun 25, 2023Updated 2 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 6 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.☆15Jan 9, 2023Updated 3 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Apr 22, 2019Updated 6 years ago
- Convert 3D Human Pose to VMD file☆14Apr 21, 2019Updated 6 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]☆35Oct 31, 2025Updated 4 months ago
- Source code and study data for the TOG 2021 paper: Mid-Air Drawing of Curves on 3D Surfaces in Virtual Reality.☆23Mar 22, 2022Updated 4 years ago
- [ECCV2022] Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance☆68Mar 28, 2024Updated last year
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence☆19Jun 14, 2024Updated last year
- MICCAI 2013 code - Segmenting Multiple Overlapping Cervical Cells by Joint Level Set☆12Jun 19, 2013Updated 12 years ago
- Repository for the paper: "Birds of a Feather: Capturing Avian Shape Models from Images"☆20Dec 2, 2022Updated 3 years ago
- mobilenet骨架的人脸检测及人脸关键点检测轻量级网络。win10直接运行bat批处理程序进行图片、视频、摄像头的人脸检测和人脸关键点检测☆12Feb 24, 2020Updated 6 years ago
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Nov 27, 2022Updated 3 years ago
- Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)☆36Feb 14, 2025Updated last year
- An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/☆28Feb 12, 2021Updated 5 years ago
- ☆13Jun 22, 2024Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Apr 23, 2025Updated 11 months ago
- Repository for the PyOpenGL Project (LaunchPad Mirror)☆16Jul 9, 2019Updated 6 years ago
- A simple character input method based on HMM☆22Apr 22, 2018Updated 7 years ago
- This is a repository for organizing papers ,codes, and etc related to Domain Generalization☆27Apr 13, 2023Updated 2 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆13Mar 16, 2023Updated 3 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- List of direct speech-to-speech translation papers.☆38Jan 31, 2023Updated 3 years ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆16Jan 29, 2022Updated 4 years ago