scliubit / PPT2VideoLinks

generate video with voice narration from ppt/pdf Slides

☆10

Alternatives and similar repositories for PPT2Video

Users that are interested in PPT2Video are comparing it to the libraries listed below

Sorting:

jjihwan / Voice-Cloning
Simple, Unified Repository for Retrieval-based Voice Conversion
☆17Updated last year
facebookresearch / synlm
Code for paper: "Privately generating tabular data using language models".
☆15Updated 2 years ago
swarupbehera / awesome-audio-visual-question-answering
A curated list of resources in audio visual question answering and related area. :-)
☆10Updated 3 weeks ago
airfold / airlang
⚡ From Zero to Monitoring LLMs in 5 minutes ⚡
☆6Updated last year
YuShi1213 / CRfusionGait_pytorch
This is the code for the "Robust Gait Recognition based on Deep CNNs with Camera and Radar Sensor Fusion".
☆13Updated 2 years ago
utkarshaditya01 / IR---The-Entertainment-Knowledge-Graph
Information Retrieval project.
☆9Updated 3 years ago
MHassaanButt / Flight-Delays-Prediction
In this project, I used Decision Tree Learning Model as the main algorithm to build the model. Due to the big amount of flight data, we i…
☆12Updated 3 years ago
bombom713 / Try-On-Diffusion
☆10Updated last year
SerCom-KC / cartoon-network-videos
We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…
☆10Updated this week
RafayKhattak / LlamaDoc
Taking advantage of LlamaIndex's in-context learning paradigm, LlamaDoc empowers users to input PDF documents and pose any questions rela…
☆14Updated 2 years ago
JiwanSeo / RAQ-VAE
Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models
☆13Updated 5 months ago
TengHu / Interactive-RAG
☆16Updated last year
Mereep / assistant-gpt
Extensible ChatGPT Frontend to search the web, create files and execute arbitrary commands
☆9Updated 2 years ago
DongKeon / webrtc-whisper-asr
WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.
☆12Updated 10 months ago
MLArtist / intent-detection-using-XLM-Roberta
This repository is a comprehensive project that leverages the XLM-Roberta model for intent detection. This repository is a valuable resou…
☆14Updated last year
gaspardpetit / verbatim
A composition of offline tools to achieve high quality multilingual speech to text transcription
☆19Updated last month
kabbas570 / Dog-Noseprint-Recognition-and-Localization-
☆11Updated 4 years ago
anas-rz / specmix-pytorch
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆11Updated 2 years ago
sahilichake / Document-Summarization-App-using-LLM
Document Summarization App using large language model (LLM) and Langchain framework. Used a pre-trained T5 model and its tokenizer from H…
☆13Updated last year
IBM / RADAR
Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vi…
☆61Updated 2 months ago
Aaron-system / GPT-Pinecone-Embeddings-PDF-with-1000-page-Contract-Law-Textbook
Vector search with Pinecone and Openai to search through contract law textbook. If downloaded, remeber to install all dependencies. Refer…
☆13Updated 2 years ago
Guest400123064 / bbm25-haystack
Simple Haystack in-memory document store alternative that performs incremental indexing and supports SentencePiece tokenizer.
☆17Updated last year
AI4Bharat / indic-asr-api-backend
Indic-Conformer models for ASR
☆17Updated last year
daveshap / InformationCompanionChatbot
Experiment for creating a safe companion chatbot (according to OpenAI rules)
☆13Updated 3 years ago
MatheusSchaly / Online-Courses
☆18Updated 2 years ago
allen4747 / Ferret
This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models
☆14Updated 10 months ago
KhanhHua2102 / Monetize.ai
Monetize.ai is a web-based chatbot that provides personalized investment advice using GPT-3.5 and Yahoo Finance API. It's built using Fla…
☆15Updated 2 years ago
freds0 / kabooks
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆12Updated 2 years ago
LorenzoGianassi / Land-Diffuser
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Updated last year
Bhashini-IITJ / visualTranslation
Implementation of Baseline for Scene Text-to-Scene Text Translation
☆16Updated 3 months ago