surisdi/youtube-8m

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/surisdi/youtube-8m)

surisdi / youtube-8m

Starter code for working with the YouTube-8M dataset.

☆16

Alternatives and similar repositories for youtube-8m

Users that are interested in youtube-8m are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kajiyu / LLLNet
View on GitHub
Keras Implementation of "Look, Listen and Learn" Model
☆21Nov 14, 2017Updated 8 years ago
YapengTian / AVE-ECCV18
View on GitHub
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆210Apr 3, 2021Updated 5 years ago
CarlWangChina / REMAST-Real-time-Emotion-based-Music-Arrangement-with-Soft-Transition
View on GitHub
SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.
☆11Nov 15, 2025Updated 8 months ago
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
mandrean / postman-collection-rs
View on GitHub
A Postman Collection serialization & deserialization library, written in Rust.
☆12Mar 22, 2026Updated 4 months ago
vra / easybox
View on GitHub
☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`
☆10May 28, 2021Updated 5 years ago
showlab / AVA-AVD
View on GitHub
☆22Nov 24, 2022Updated 3 years ago
zhangyuwangumass / Glyph-based-Chinese-Character-Embedding
View on GitHub
Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019
☆11Mar 18, 2019Updated 7 years ago
donohoe / simple-gdpr-lockdown
View on GitHub
You don't need to block EU visitors over GDPR. Just lockdown your site.
☆14May 25, 2018Updated 8 years ago
adxcreative / D-M
View on GitHub
The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…
☆10Feb 9, 2025Updated last year
NeuroRoboticTech / AnimatLabPublicSource
View on GitHub
Public source code for AnimatLab neuromechanical simulator system
☆15Oct 4, 2016Updated 9 years ago
Archivoice / so-vits-svc
View on GitHub
基于vits与softvc的歌声音色转换模型
☆12Jan 9, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
LARC-CMU-SMU / ACME
View on GitHub
Learning Cross-Modal Embeddings with Adversarial Networks for Cooking Recipes and Food Images
☆30Jun 14, 2019Updated 7 years ago
wzk1015 / video-bgm-generation
View on GitHub
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
☆327Jun 8, 2025Updated last year
tencent-wechat / TransferStatement
View on GitHub
phxpaxos/phxrpc/phxsql had been transferred to Tencent
☆10Aug 2, 2017Updated 8 years ago
hangzhaomit / Sound-of-Pixels
View on GitHub
Codebase for ECCV18 "The Sound of Pixels"
☆393Apr 25, 2022Updated 4 years ago
lllllT / AtmosphereLogger
View on GitHub
Logs atmospheric pressure by using Android device's barometer sensor.
☆16Nov 11, 2018Updated 7 years ago
CennyMo / Virtual-Makeup
View on GitHub
An adjustment of the existing Virtual Makeup repository https://github.com/srivatsan-ramesh/Virtual-Makeup and https://github.com/badarsh…
☆11Mar 13, 2020Updated 6 years ago
a43992899 / DeID-VC
View on GitHub
Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion
☆13May 6, 2023Updated 3 years ago
Kelvin-Zhong / Click-Through-Rate-Prediction
View on GitHub
This is a Kaggle data mining contest, link: https://www.kaggle.com/c/avazu-ctr-prediction
☆11Mar 12, 2015Updated 11 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Happenmass / stable-diffusion-webui-tensorRT-sdxl
View on GitHub
Stable-diffusion-WebUI extensions, which enable tensorrt accelerated Unet for SDXL base model
☆12Oct 18, 2023Updated 2 years ago
aaronzguan / Autonomous-Bin-Picking
View on GitHub
RLBench simulation project for autonomous bin picking using Pandas robot arm
☆11Mar 1, 2021Updated 5 years ago
w3c / i18n-discuss
View on GitHub
A place to hold discussions on i18n topics, and to put documents that summarise, support or initiate those discussions.
☆21May 2, 2026Updated 2 months ago
dreamdragon / vatic
View on GitHub
An extension to original vatic tools for human action labeling. vatic is an online, interactive video annotation tool for computer vision…
☆28Jun 21, 2014Updated 12 years ago
andrewowens / multisensory
View on GitHub
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
☆225Jul 17, 2019Updated 7 years ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Jul 20, 2026Updated last week
pashpashpash / Eye-Contact-Detection-With-OpenFace
View on GitHub
A tool built on top of OpenFace to detect eye contact with babies.
☆13Nov 27, 2018Updated 7 years ago
zhishui3 / Tencent_AILab_ChineseEmbedding
View on GitHub
Tencent_AILab_ChineseEmbedding
☆12Dec 30, 2018Updated 7 years ago
wenqsun / Freeplane
View on GitHub
Code for paper: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models
☆18Jun 6, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
idansc / simple-avsd
View on GitHub
Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``
☆27May 26, 2020Updated 6 years ago
ashwatc / Video_Gesture_Overlay
View on GitHub
A machine learning and computer vision based application to recognize hand gestures and facial tracking, and subsequently display corresp…
☆14Dec 28, 2020Updated 5 years ago
vensu-art / AnimatedDrawingsGradio
View on GitHub
Added Gradio UI to accompany "A Method for Animating Children's Drawings of the Human Figure"
☆13May 24, 2023Updated 3 years ago
lifelong-robotic-vision / lifelong-object-recognition-challenge
View on GitHub
This is a PyTorch implementation of baseline model of IROS2019 lifelong object recognition challenge.
☆15Oct 3, 2023Updated 2 years ago
Xin-Ye-1 / HRL-GRG
View on GitHub
☆17Mar 26, 2021Updated 5 years ago
entn-at / DurIAN-1
View on GitHub
Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".
☆15Jul 6, 2020Updated 6 years ago
kevinliang888 / IVR-QA-baselines
View on GitHub
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
☆20Apr 16, 2024Updated 2 years ago