scliubit / PPT2VideoLinks
generate video with voice narration from ppt/pdf Slides
☆10Updated 2 years ago
Alternatives and similar repositories for PPT2Video
Users that are interested in PPT2Video are comparing it to the libraries listed below
Sorting:
- Code for paper: "Privately generating tabular data using language models".☆15Updated 2 years ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Updated 2 years ago
- Simple, Unified Repository for Retrieval-based Voice Conversion☆17Updated last year
- This is the code for the "Robust Gait Recognition based on Deep CNNs with Camera and Radar Sensor Fusion".☆13Updated 2 years ago
- Faysal-MD / Unmasking-Deepfake-Faces-from-Videos-An-Explainable-Cost-Sensitive-Deep-Learning-Approach-IEEE2023Deepfake faces detection from forged videos where used explainable AI for models' robustness as well as cost sensitive methods for mitiga…☆10Updated last year
- Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only pro…☆11Updated last year
- In this project, I used Decision Tree Learning Model as the main algorithm to build the model. Due to the big amount of flight data, we i…☆12Updated 4 years ago
- ☆18Updated last year
- This is the official implementation for the paper: Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models☆19Updated last year
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆14Updated 2 months ago
- Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vi…☆70Updated 4 months ago
- A curated list of resources in audio visual question answering and related area. :-)☆17Updated 7 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆35Updated 2 years ago
- Instructions and code to create for a table of UMLS, SNOMED or HPO concepts containing Dutch medical names, usable in named entity recogn…☆11Updated 3 years ago
- ☆15Updated 2 years ago
- A desktop compatible version of the Defog app☆14Updated last year
- ☆10Updated 2 years ago
- A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of dom…☆23Updated last year
- Taking advantage of LlamaIndex's in-context learning paradigm, LlamaDoc empowers users to input PDF documents and pose any questions rela…☆14Updated 2 years ago
- Deep metric learning: Triplet, Magnet and VMF loss☆11Updated 3 years ago
- Sample and Computation Redistribution for Efficient Face Detection☆16Updated last year
- Code of Spectral-Temporal Low-Rank Regularization with Deep Prior for Thick Cloud Removal☆19Updated 2 years ago
- Enabling the use of multiple modalities while prompting Stable Diffusion☆15Updated 3 years ago
- This is the code of the paper "SpectrumFM: A Foundation Model for Intelligent Spectrum Management"☆25Updated last month
- survery of small language models☆18Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated 2 years ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Updated 4 months ago
- This repository is a comprehensive project that leverages the XLM-Roberta model for intent detection. This repository is a valuable resou…☆16Updated 2 years ago
- Python scripts and assets related to Multimodal-Wireless dataset. The dataset can be found at☆18Updated 2 weeks ago
- Streaming responses with Streamlit, ChatGPT and Langchain.☆11Updated 2 years ago