Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"
☆12Apr 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for wav2vec-S
Users that are interested in wav2vec-S are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆26Aug 12, 2021Updated 4 years ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- ☆13Jan 9, 2024Updated 2 years ago
- AI for generals.io in Typescript☆13Feb 1, 2017Updated 9 years ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆44May 3, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- ☆18Jun 3, 2024Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.☆11Updated this week
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆27Nov 14, 2024Updated last year
- PyTorch implementation of paper "MT-ORL: Multi-Task Occlusion Relationship Learning" (ICCV 2021)☆19Oct 17, 2021Updated 4 years ago
- [ICCV2023] PyTorch implementation of ''Spatial-Aware Token for Weakly Supervised Object Localization''.☆23Oct 24, 2023Updated 2 years ago
- [IJCV] PyTorch implementation of "Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation"☆19Oct 25, 2023Updated 2 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19Jul 2, 2022Updated 3 years ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆69Jan 23, 2026Updated 3 months ago
- YOLOv5模型剪枝☆21Mar 6, 2021Updated 5 years ago
- A image caption dataset about images from www.dpchallenge.com.☆20Dec 12, 2019Updated 6 years ago
- ☆34Mar 25, 2023Updated 3 years ago
- [FCS'24] LVLM Safety paper☆19Jan 4, 2025Updated last year
- This repository implements our EMNLP 2022 research paper A Dataset for Hyper-Relational Extraction and a Cube-Filling Approach.☆28Dec 13, 2022Updated 3 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A mod that injects MGL and patches Minecraft to work with it.☆12Apr 10, 2024Updated 2 years ago
- Scripts for generating Darknet YOLO training data using blender animations☆22Mar 2, 2019Updated 7 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper☆39Mar 21, 2023Updated 3 years ago
- 语音合成从零开始☆11Nov 28, 2023Updated 2 years ago
- Code and datasets of TPAMI 2022 paper《OPOM: Customized Invisible Cloak towards Face Privacy Protection》☆22May 13, 2022Updated 3 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Turn any Windows precision touchpad into a touchscreen.☆12Oct 21, 2018Updated 7 years ago
- This is a shader can running on Minecraft Java Edition For Phone project which uses GL4ES. This repository contains source code for iOS/i…☆14Aug 13, 2023Updated 2 years ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆123Sep 13, 2024Updated last year
- A Blender pipeline for generating synthetic images of production lines☆30Aug 13, 2023Updated 2 years ago
- Manjaro ISO for Apple T2-based devices☆15Dec 25, 2022Updated 3 years ago
- Translator made fully in Python Vanilla that is able to translate in: Simplified Mandarin Chinese, Traditional Mandarin Chinese, Chinese …☆15May 28, 2023Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year