salma71 / text_speechLinks
Using Gradio interface to build UI for converting text to speech
☆13Updated 4 years ago
Alternatives and similar repositories for text_speech
Users that are interested in text_speech are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆13Updated 3 years ago
- ☆14Updated 10 months ago
- bumble bee transformer☆14Updated 4 years ago
- Benchmarks for Business Document Foundation Models☆10Updated last year
- ☆26Updated 6 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- ☆24Updated 4 years ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Updated last year
- A sample pattern for running CI tests on Modal☆18Updated 4 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆17Updated 10 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 9 months ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- Entailment self-training☆25Updated 2 years ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- Benchmarking algorithms for assessing quality of data labeled by multiple annotators☆32Updated 2 years ago
- ☆29Updated 3 weeks ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Load any clip model with a standardized interface☆22Updated last week
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- ☆49Updated 11 months ago