List of Large Lanugage Model Papers
☆59Jun 5, 2023Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- List of NN based singal processing papers☆22Jun 5, 2023Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- ☆29Aug 8, 2024Updated last year
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 16, 2026Updated last week
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆55Dec 6, 2023Updated 2 years ago
- 基于Android平台以及Mina框架实现图片面对面快传的APP☆11Jan 24, 2017Updated 9 years ago
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15May 30, 2019Updated 6 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- Simple implement dilated LSTM, residual LSTM and Attention LSTM (follow the corresponding papers).☆17Dec 26, 2019Updated 6 years ago
- ☆25Nov 23, 2021Updated 4 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Jul 4, 2024Updated last year
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 9 months ago
- ☆13Mar 30, 2023Updated 2 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 9 months ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆22Jul 8, 2019Updated 6 years ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 6 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- ☆32Oct 28, 2022Updated 3 years ago
- ☆25Jan 2, 2024Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆53Jun 14, 2024Updated last year
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆57Sep 1, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆25Feb 12, 2023Updated 3 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Apr 16, 2023Updated 2 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆42Jan 17, 2025Updated last year
- How loud is that file?☆12Sep 3, 2019Updated 6 years ago