oneapi-src / voice-data-generationLinks

AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch

☆2

Alternatives and similar repositories for voice-data-generation

Users that are interested in voice-data-generation are comparing it to the libraries listed below

Sorting:

kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 2 weeks ago
bhimrazy / chat-with-qwen2-vl
Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆10Updated 10 months ago
yhenon / llm-face-vision
Benchmarking vision language vision on face tasks
☆14Updated 3 months ago
roboflow / cvevals
Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…
☆36Updated last year
mistralai / TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…
☆12Updated last year
ictorv / Large-Language-Pretraining
Building large language foundational model
☆9Updated 4 months ago
facebookresearch / synlm
Code for paper: "Privately generating tabular data using language models".
☆15Updated 2 years ago
fabridigua / LogicGamesSolver
A Python tool to solve logic games with AI, Deep Learning and Computer Vision
☆17Updated 4 years ago
deep-diver / LoRA-deployment
LoRA fine-tuned Stable Diffusion Deployment
☆31Updated 2 years ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated 11 months ago
lucasjinreal / wnnx_models
Various test models in WNNX format. It can view with `pip install wnetron && wnetron`
☆12Updated 3 years ago
Rishit-dagli / Compositional-Attention
An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA
☆14Updated 3 years ago
facebookresearch / MultiModalExplorer
Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…
☆27Updated last year
Vedant-S / MLOps-Project
Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.
☆12Updated 4 years ago
aniketmaurya / stable_diffusion_inference
Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).
☆17Updated last year
facebookresearch / tce
Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.
☆13Updated last year
facebookresearch / dmae_st
Directed masked autoencoders
☆14Updated 2 years ago
sayakpaul / parameter-ensemble-differential-evolution
Shows how to do parameter ensembling using differential evolution.
☆10Updated 3 years ago
umass-ml4ed / prompt_distractor_generation_NAACL
Official repo for the paper "Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models" at N…
☆8Updated 5 months ago
Netflix / clove
☆13Updated 10 months ago
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆14Updated 2 years ago
Zyphra / zcookbook
Training hybrid models for dummies.
☆25Updated 6 months ago
philschmid / huggingface-inferentia2-samples
☆10Updated last year
ternaus / base64ToImageConverters
Library for converting from RGB / GrayScale image to base64 and back.
☆19Updated 2 years ago
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 7 months ago
sail-sg / TEC
☆16Updated 2 years ago
harrytea / TGDoc
"Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023
☆14Updated 7 months ago
hsm207 / rasa_kg
How to integrate a knowledge graph into a chatbot to do entity resolution
☆7Updated 3 years ago
joheras / Lecturas
☆19Updated this week
NVIDIA-Merlin / core
Core Utilities for NVIDIA Merlin
☆19Updated 11 months ago