oneapi-src / voice-data-generation
AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch
☆2Updated last year
Alternatives and similar repositories for voice-data-generation:
Users that are interested in voice-data-generation are comparing it to the libraries listed below
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆11Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 5 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- ☆12Updated 7 months ago
- Automatic Test Generator☆12Updated last week
- ☆14Updated 9 months ago
- Building large language foundational model☆9Updated 3 weeks ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 4 months ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 4 months ago
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆12Updated 11 months ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆13Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- A small python library to run iterators in a separate process☆10Updated last year
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆18Updated last month
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Updated 2 years ago
- ☆11Updated 2 weeks ago
- Scripts for text classification with llama and bert☆13Updated last month
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated 10 months ago
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- Submission to the inverse scaling prize☆23Updated last year
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Updated 4 years ago
- efficient query encoding for dense retrieval☆11Updated 7 months ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 2 years ago