oneapi-src / voice-data-generation
AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch
☆2Updated last year
Alternatives and similar repositories for voice-data-generation:
Users that are interested in voice-data-generation are comparing it to the libraries listed below
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆16Updated 2 months ago
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆11Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- ☆13Updated 7 months ago
- Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10Updated 7 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 10 months ago
- Building large language foundational model☆9Updated last month
- Training hybrid models for dummies.☆20Updated 3 months ago
- ☆11Updated 4 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- ☆13Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- The Facial Landmark Preprocessing Toolkit.☆13Updated 10 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- DocGenius AI - Generative AI Chatbot for your Documents☆11Updated last month
- Smaug-72B topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-s…☆17Updated last week
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- ☆9Updated 2 years ago
- Build relationship Graphs using LLM in a Retrieval-Augmented Generation(RAG) framework with pgvector as a vector database☆10Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- ☆19Updated this week
- ☆10Updated 10 months ago
- efficient query encoding for dense retrieval☆11Updated 8 months ago
- Benchmarking vision language vision on face tasks☆12Updated 3 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- ☆25Updated last year