NVIDIA / synthdaView external linksLinks
SynthDa is a framework designed to make synthetic data generation for human actions more usable and accessible. This is a pose-level augmentation framework that generates synthetic training videos by interpolating real and AI-generated poses. It increases minority-class coverage, helping to mitigate data scarcity for rare actions.
☆41Sep 21, 2025Updated 4 months ago
Alternatives and similar repositories for synthda
Users that are interested in synthda are comparing it to the libraries listed below
Sorting:
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Implementation for "StyleGAN-Canvas: Augmenting StyleGAN3 for Real-Time Human-AI Co-Creation"☆11May 24, 2023Updated 2 years ago
- Virtual character locomotion system. See article“Motion Graphs”, Lucas Kovar, 2002☆12Mar 1, 2012Updated 13 years ago
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆20Dec 11, 2025Updated 2 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- ☆12Oct 14, 2024Updated last year
- Project containing examples on how to use AiFi's Public Dataset on People Shopping☆11May 22, 2023Updated 2 years ago
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 8 months ago
- A cage-based deformation for meshes in 2D.☆14Sep 8, 2018Updated 7 years ago
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆36Jul 4, 2025Updated 7 months ago
- A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling☆15Dec 5, 2023Updated 2 years ago
- LMM for VQA, tcsvt version☆11Jul 19, 2024Updated last year
- An open-source deep learning framework for tooth enumeration and segmentation in intraoral photos☆24Apr 27, 2025Updated 9 months ago
- Code to reproduce 'MOCCA: Multi-Layer One-Class Classification for Anomaly Detection'☆10Dec 12, 2021Updated 4 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated 10 months ago
- [WIP] Python port/rewrite of pbrt, the physically based renderer by Matt Pharr and Greg Humphreys☆13May 19, 2013Updated 12 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- scripts to convert model formats with blender☆12Aug 5, 2016Updated 9 years ago
- C++/OpenGL tool for cage-based deformation, including functionality for cage generation. Originally developed as a fall 2019 term project…☆12Sep 10, 2020Updated 5 years ago
- This should provide a minimum example for jetson☆11Jun 19, 2017Updated 8 years ago
- ☆15Apr 9, 2023Updated 2 years ago
- ☆15May 27, 2020Updated 5 years ago
- Master's thesis on volumetric rendering. Contains a path tracing and a progressive volumetric photon mapping implementation.☆14Nov 28, 2025Updated 2 months ago
- ☆13Feb 12, 2024Updated 2 years ago
- Video examples of "Appearance Composing GAN: A General Method for Appearance-Controllable Human Video Motion Transfer"☆15Dec 28, 2020Updated 5 years ago
- Official code for "FedVAD: Enhancing Federated Video Anomaly Detection with GPT-Driven Semantic Distillation"☆15Jul 13, 2024Updated last year
- A utility library to help integrate Python applications with Metropolis Microservices for Jetson☆16Dec 21, 2024Updated last year
- Official implementation for CVPR'2021 paper Neural Deformation Graphs☆13Jul 13, 2021Updated 4 years ago
- Implemented large scale 3d mesh generation in parallel using CUDA.☆13May 6, 2019Updated 6 years ago
- Using oblique decision trees to represent 3D shapes☆18Aug 20, 2023Updated 2 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated last month
- Official Implementation of AQ-GT: a Temporally Aligned and Quantized GRU-Transformer for Co-Speech Gesture Synthesis with the extension (…☆21Apr 19, 2024Updated last year
- a local RAG LLM with persistent database to query your PDFs☆16Feb 8, 2024Updated 2 years ago
- Awesome-Text2Motion-Generation☆18Oct 26, 2023Updated 2 years ago
- Official implementation for GarmageNet: A Multimodal Generative Framework for Sewing Pattern Design and Generic Garment Modeling (SIGGRAP…☆43Jan 16, 2026Updated 3 weeks ago
- ☆19Jun 29, 2025Updated 7 months ago
- Real-time multithreaded webcam capture in modern C++20 & OpenCV; 70 FPS with lock-free ring buffer☆19Jul 21, 2025Updated 6 months ago
- Cuda Implementation of Killing Fusion☆13Dec 6, 2019Updated 6 years ago
- Retarget facial animation created with the LiveLinkFace app and saved as CSV onto a metahuman skeleeton in MotionBuilder.☆19Feb 11, 2022Updated 4 years ago