AI-Hypercomputer / torchprime
torchprime is a reference model implementation for PyTorch on TPU.
☆15Updated this week
Alternatives and similar repositories for torchprime:
Users that are interested in torchprime are comparing it to the libraries listed below
- (WIP)long form speech generatoins☆31Updated 3 weeks ago
- ☆27Updated this week
- ☆13Updated last week
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.☆52Updated this week
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆18Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆49Updated 4 months ago
- ☆9Updated 9 months ago
- Simple voice activity detection (VAD) algorithm in Python☆12Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- ☆20Updated 6 months ago
- ☆26Updated 2 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆48Updated this week
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆48Updated 5 months ago
- faster inference☆28Updated 3 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆51Updated last month
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆27Updated 3 months ago
- Text-To-Speech for NotebookLM☆29Updated 4 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆59Updated 3 weeks ago
- Wenet speech to text for react native☆10Updated 2 years ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆34Updated 2 months ago
- Implementation of Google's USM speech model in Pytorch☆31Updated 2 weeks ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated 9 months ago
- A low-bitrate single-codebook 16 kHz speech codec based on focal modulation☆85Updated 2 months ago
- Streaming Text to Speech Web UI☆18Updated 11 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 3 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆33Updated 6 months ago
- ☆48Updated 3 weeks ago
- silero-vad pytorch implement☆17Updated 5 months ago
- A spoken version of the textual story cloze benchmark☆16Updated last year