Starter code for working with the YouTube-8M dataset.
☆16Jun 9, 2017Updated 8 years ago
Alternatives and similar repositories for youtube-8m
Users that are interested in youtube-8m are comparing it to the libraries listed below
Sorting:
- An extension to original vatic tools for human action labeling. vatic is an online, interactive video annotation tool for computer vision…☆28Jun 21, 2014Updated 11 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆205Apr 3, 2021Updated 4 years ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 7 years ago
- real time face swap and one-click video deepfake with only a single image☆12Sep 13, 2024Updated last year
- A tool built on top of OpenFace to detect eye contact with babies.☆13Nov 27, 2018Updated 7 years ago
- [ACM MobiSys 2024 Demo] Image-based Indoor Localization using Object Detection and LSTM☆12Feb 12, 2026Updated 3 weeks ago
- Create short vertical videos for TikTok, YouTube Shorts, and Instagram Reels using AI. Fully automated pipeline with traceability. 🚀🎥☆22Updated this week
- Stable-diffusion-WebUI extensions, which enable tensorrt accelerated Unet for SDXL base model☆12Oct 18, 2023Updated 2 years ago
- ☆17Oct 8, 2023Updated 2 years ago
- 藏语威利转写☆11Jul 19, 2016Updated 9 years ago
- SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.☆11Nov 15, 2025Updated 3 months ago
- A library to manipulate Inkscape SVG content using Python 3☆10Apr 28, 2021Updated 4 years ago
- Pythonic Nvidia Codec Library☆17Feb 23, 2026Updated 2 weeks ago
- This is the official repository for "Can GPTs Evaluate Graphic Design Based on Design Principles?".☆13Feb 10, 2025Updated last year
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- An application to solve handwritten mathematical equations using deep learning algorithms.☆13Apr 8, 2018Updated 7 years ago
- An adjustment of the existing Virtual Makeup repository https://github.com/srivatsan-ramesh/Virtual-Makeup and https://github.com/badarsh…☆11Mar 13, 2020Updated 5 years ago
- This library removes the jitter and smooth the landmarks coming from Mediapipe☆13Jan 16, 2023Updated 3 years ago
- jitsi meet video call with gstreamer☆11Nov 25, 2021Updated 4 years ago
- Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals☆11Jan 8, 2026Updated 2 months ago
- Showcase of P2P HLS streaming using WebTorrent☆12May 5, 2021Updated 4 years ago
- Python library for automated phone call testing using PJSIP☆10Aug 24, 2017Updated 8 years ago
- Gradio_demo.py with Blinking on Still Mode Video Creation☆12Jun 21, 2023Updated 2 years ago
- Automated Music Therapy Sessions using Iris and Face Detection☆11Sep 24, 2019Updated 6 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 8 months ago
- A trained model of YOLOv8 which will detect Fight or Violence and NonViolence in videos☆13Sep 20, 2024Updated last year
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Build your own frontend AI agent with Chrome☆13Updated this week
- C++17 URL Parser (RFC 3986 compliant)☆11Jan 21, 2022Updated 4 years ago
- Sync Lip in Unity by Wav2Lip☆11Jan 14, 2021Updated 5 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated last year
- Luma AI Video + Audio + Image Generation and RunwayML Video Generation from Image and Text☆16Jun 17, 2025Updated 8 months ago
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- FFmpeg node modules☆13Jan 6, 2016Updated 10 years ago
- Text detection in natural scene images using the Stoke Width Transform.☆21Aug 6, 2012Updated 13 years ago