alyssabedard / Hanzi2PinyinView external linksLinks
Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.
☆12Sep 23, 2025Updated 4 months ago
Alternatives and similar repositories for Hanzi2Pinyin
Users that are interested in Hanzi2Pinyin are comparing it to the libraries listed below
Sorting:
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- ☆16Apr 15, 2025Updated 10 months ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- ☆11Sep 1, 2024Updated last year
- 大模型学习资料☆36Oct 11, 2025Updated 4 months ago
- Audio-Visual Speech Recognition☆19Jul 7, 2025Updated 7 months ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- Comparing performance of different InfoNCE type losses used in contrastive learning.☆14Jun 12, 2024Updated last year
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 9 months ago
- A structured parsing technique for NER☆15May 26, 2023Updated 2 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 5 years ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Huggingface Implementation of AV-HuBERT on the MuAViC Dataset☆17Mar 6, 2025Updated 11 months ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- ☆25Aug 7, 2025Updated 6 months ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- Curating a collection of Mandarin Chinese vocabulary, idioms (成语), and characters (汉字). HSK 3.0, RSH, and other frequency lists.☆17Jan 18, 2024Updated 2 years ago
- Transformer-based autoregressive varitional autoencoder☆12Feb 10, 2020Updated 6 years ago
- ☆19Jun 4, 2024Updated last year
- ☆14Oct 7, 2021Updated 4 years ago
- A Silent Speech Recognizer Augmented with an Independent Repair Model☆20Oct 17, 2023Updated 2 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- 电商评论情感分析平台☆15Jan 16, 2024Updated 2 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- ☆14Jan 28, 2019Updated 7 years ago
- ☆17Jun 11, 2024Updated last year
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated 10 months ago
- FreeGrep is a BSD-licensed implementation of grep(1)☆18Nov 10, 2019Updated 6 years ago
- A Jekyll version of the "Strata" theme by HTML5 UP.☆15Dec 10, 2022Updated 3 years ago
- 中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ,即将文本中的中文数字转为阿拉伯数字。☆24Jan 8, 2026Updated last month
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆16Nov 22, 2020Updated 5 years ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆20Apr 3, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago