salma71 / text_speech
Using Gradio interface to build UI for converting text to speech
☆12Updated 4 years ago
Alternatives and similar repositories for text_speech:
Users that are interested in text_speech are comparing it to the libraries listed below
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- bumble bee transformer☆14Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 3 weeks ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Updated 4 years ago
- ☆11Updated 2 years ago
- ☆12Updated 6 months ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxi…☆13Updated 3 years ago
- Benchmarking algorithms for assessing quality of data labeled by multiple annotators☆32Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆23Updated 2 years ago
- ☆28Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆31Updated 8 months ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- ☆14Updated 3 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆16Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated this week
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆17Updated 11 months ago