Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"
☆11Apr 20, 2025Updated 11 months ago
Alternatives and similar repositories for wav2vec-S
Users that are interested in wav2vec-S are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆26Aug 12, 2021Updated 4 years ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- ☆12Jan 9, 2024Updated 2 years ago
- AI for generals.io in Typescript☆14Feb 1, 2017Updated 9 years ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆43May 3, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"☆13Nov 3, 2022Updated 3 years ago
- ☆18Jun 3, 2024Updated last year
- ☆14Apr 16, 2024Updated last year
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.☆11Updated this week
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆25Nov 14, 2024Updated last year
- PyTorch implementation of paper "MT-ORL: Multi-Task Occlusion Relationship Learning" (ICCV 2021)☆19Oct 17, 2021Updated 4 years ago
- [ICCV2023] PyTorch implementation of ''Spatial-Aware Token for Weakly Supervised Object Localization''.☆23Oct 24, 2023Updated 2 years ago
- [IJCV] PyTorch implementation of "Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation"☆19Oct 25, 2023Updated 2 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Jul 2, 2022Updated 3 years ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆68Jan 23, 2026Updated 2 months ago
- YOLOv5模型剪枝☆21Mar 6, 2021Updated 5 years ago
- A image caption dataset about images from www.dpchallenge.com.☆20Dec 12, 2019Updated 6 years ago
- ☆34Mar 25, 2023Updated 3 years ago
- [FCS'24] LVLM Safety paper☆19Jan 4, 2025Updated last year
- This repository implements our EMNLP 2022 research paper A Dataset for Hyper-Relational Extraction and a Cube-Filling Approach.☆28Dec 13, 2022Updated 3 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A mod that injects MGL and patches Minecraft to work with it.☆12Apr 10, 2024Updated last year
- Scripts for generating Darknet YOLO training data using blender animations☆22Mar 2, 2019Updated 7 years ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper☆39Mar 21, 2023Updated 3 years ago
- 语音合成从零开始☆11Nov 28, 2023Updated 2 years ago
- Code and datasets of TPAMI 2022 paper《OPOM: Customized Invisible Cloak towards Face Privacy Protection》☆22May 13, 2022Updated 3 years ago
- Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"☆24Dec 11, 2023Updated 2 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Turn any Windows precision touchpad into a touchscreen.☆12Oct 21, 2018Updated 7 years ago
- This is a shader can running on Minecraft Java Edition For Phone project which uses GL4ES. This repository contains source code for iOS/i…☆14Aug 13, 2023Updated 2 years ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆124Sep 13, 2024Updated last year
- A Blender pipeline for generating synthetic images of production lines☆29Aug 13, 2023Updated 2 years ago
- Manjaro ISO for Apple T2-based devices☆14Dec 25, 2022Updated 3 years ago
- Translator made fully in Python Vanilla that is able to translate in: Simplified Mandarin Chinese, Traditional Mandarin Chinese, Chinese …☆15May 28, 2023Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated last year