A Silent Speech Recognizer Augmented with an Independent Repair Model
☆21Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for LipType
Users that are interested in LipType are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- ☆103Jun 30, 2020Updated 5 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- ☆14Jul 27, 2022Updated 3 years ago
- Audio-Visual Speech Recognition☆21Jul 7, 2025Updated 8 months ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Implementation for the Paper 'Self-Attention with Relative Position Representations' Alongside a Comprehensive Demonstration.☆10Mar 12, 2026Updated 2 weeks ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- Transformer-based autoregressive varitional autoencoder☆12Feb 10, 2020Updated 6 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 6 months ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Comparing performance of different InfoNCE type losses used in contrastive learning.☆14Jun 12, 2024Updated last year
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 11 months ago
- ☆13Mar 10, 2026Updated 2 weeks ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- ☆26Aug 7, 2025Updated 7 months ago
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 5 years ago
- ☆14Jan 28, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- Image-Processing-Node-Editor で動作するYouTube入力用ノード☆13Jul 12, 2025Updated 8 months ago
- マウスクリックで指定した座標を矩形に射影変換するプログラム。☆10Jul 9, 2020Updated 5 years ago
- ☆12Jun 22, 2020Updated 5 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.☆11Apr 14, 2021Updated 4 years ago
- ☆10May 12, 2018Updated 7 years ago
- XDoG(Extended Difference of Gaussians)アルゴリズムを用いた線画抽出のサンプルです。☆15Jan 28, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Segment Anything Model (SAM) interactive demo with OpenVINO☆13Jun 5, 2024Updated last year
- OpenCV Sample Projects in Rust☆12Nov 27, 2021Updated 4 years ago
- 中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ,即将文本中的中文数字转为阿拉伯数字。☆26Jan 8, 2026Updated 2 months ago
- OpenCVでのQRコード検出サンプルプログラム。QRCodeDetector(detectAndDecode, detectAndDecodeMulti, detectAndDecodeCurved)とWeChatQRCode(detectAndDecode)の4サンプル…☆10Jun 16, 2022Updated 3 years ago
- 処理の検証や比較検討での用途を想定したノードエディターベースの画像処理アプリ☆11Mar 5, 2023Updated 3 years ago
- NanoDetをGoogle Colaboratory上で訓練しONNX形式のファイルをエクスポートするサンプル(This is a sample to training NanoDet on Google Colaboratory and export a file in…☆13Aug 4, 2022Updated 3 years ago
- PyCon mini 東海 2024 のトーク「Google Colaboratoryで試すVLM」で紹介したサンプル集☆12Nov 15, 2024Updated last year