scliubit / PPT2VideoLinks
generate video with voice narration from ppt/pdf Slides
☆10Updated 2 years ago
Alternatives and similar repositories for PPT2Video
Users that are interested in PPT2Video are comparing it to the libraries listed below
Sorting:
- Simple, Unified Repository for Retrieval-based Voice Conversion☆17Updated last year
- This is the code for the "Robust Gait Recognition based on Deep CNNs with Camera and Radar Sensor Fusion".☆13Updated 2 years ago
- Code for paper: "Privately generating tabular data using language models".☆15Updated 2 years ago
- Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vi…☆62Updated last week
- The implementation of CV-CFUNet using tensorflow. (CV-CFUNet: Complex-Valued Channel Fusion UNet for Refocusing of Ship Targets in SAR Im…☆15Updated 2 years ago
- Indic-Conformer models for ASR☆18Updated last year
- A curated list of resources in audio visual question answering and related area. :-)☆14Updated 2 months ago
- ☆10Updated last year
- A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.☆12Updated 7 months ago
- Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only pro…☆11Updated 8 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated 2 years ago
- Code of Spectral-Temporal Low-Rank Regularization with Deep Prior for Thick Cloud Removal☆19Updated last year
- ☆15Updated 2 years ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆13Updated 2 weeks ago
- ☆32Updated 10 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- Official implementation of Adaptive Feature Transfer (AFT)☆23Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated 3 weeks ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 8 months ago
- Co-VeGAN: Complex-Valued Generative Adversarial Network for Compressive Sensing MR Image Reconstruction☆14Updated last year
- Taking advantage of LlamaIndex's in-context learning paradigm, LlamaDoc empowers users to input PDF documents and pose any questions rela…☆14Updated 2 years ago
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆17Updated last year
- DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The …☆13Updated last year
- ☆50Updated last year
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆11Updated 3 years ago
- ☆14Updated 2 years ago
- Benchmarks for Business Document Foundation Models☆10Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆15Updated last year
- Investigating Cultural Alignment of Large Language Models☆13Updated last year