oncescuandreea/QuerYD_downloader

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oncescuandreea/QuerYD_downloader)

oncescuandreea / QuerYD_downloader

☆23

Alternatives and similar repositories for QuerYD_downloader

Users that are interested in QuerYD_downloader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google-research-datasets / Video-Timeline-Tags-ViTT
View on GitHub
A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…
☆30Jan 15, 2022Updated 4 years ago
JonghwanMun / MarioQA
View on GitHub
Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017
☆10Oct 28, 2025Updated 8 months ago
microsoft / GEM
View on GitHub
☆25Jun 25, 2021Updated 5 years ago
roudimit / c2kd
View on GitHub
Code for the C2KD paper (ICASSP 2023)
☆20May 15, 2023Updated 3 years ago
oncescuandreea / audio-retrieval
View on GitHub
Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch
☆26Aug 18, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Annusha / temperature_schedules
View on GitHub
Temperature Schedules for self-supervised contrastive methods on long-tail data (ICLR'23)
☆18Apr 25, 2023Updated 3 years ago
Healbadbad / curveball-pytorch
View on GitHub
An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch
☆21Jul 16, 2018Updated 8 years ago
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
roudimit / AVLnet
View on GitHub
Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.
☆54Mar 30, 2022Updated 4 years ago
kamperh / vqwordseg
View on GitHub
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆39May 5, 2026Updated 2 months ago
jonathan-roberts1 / SciFIBench
View on GitHub
NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
☆13May 24, 2025Updated last year
BayesWatch / pytorch-blockswap
View on GitHub
Code for BlockSwap (ICLR 2020).
☆33Mar 25, 2021Updated 5 years ago
facebookresearch / AVID-CMA
View on GitHub
Audio Visual Instance Discrimination with Cross-Modal Agreement
☆133Aug 13, 2021Updated 4 years ago
TerryPei / CSP
View on GitHub
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
☆10Dec 15, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
gokceuludogan / interactive-music-recommendation
View on GitHub
Personalized and Interactive Music Recommendation with Bandit approach
☆11Sep 15, 2019Updated 6 years ago
lilianemomeni / KWS-Net
View on GitHub
Seeing Wake Words: Audio-visual Keyword Spotting
☆67Sep 16, 2020Updated 5 years ago
afperezm / acoustic-images-distillation
View on GitHub
Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
edsonroteia / cav-mae-sync
View on GitHub
[CVPR25] Official Implementation of CAV-MAE Sync
☆31Apr 5, 2026Updated 3 months ago
cvlab-columbia / globetrotter
View on GitHub
Code for the Globetrotter project
☆23Mar 17, 2022Updated 4 years ago
antoine77340 / S3D_HowTo100M
View on GitHub
S3D Text-Video model trained on HowTo100M using MIL-NCE
☆200Jul 3, 2020Updated 6 years ago
keithnoguchi / do-in-action
View on GitHub
DO with Terraform and Ansible
☆11Jun 5, 2018Updated 8 years ago
brian7685 / Multimodal-Clustering-Network
View on GitHub
ICCV 2021
☆34May 11, 2022Updated 4 years ago
Cloud-CV / vilbert-multi-task
View on GitHub
12-in-1: Multi-Task Vision and Language Representation Learning Web Demo
☆35Dec 8, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gooofy / zbrain
View on GitHub
Infrastructure useful to create natural language processing systems based on transformer networks
☆12Sep 26, 2019Updated 6 years ago
wnhsu / ResDAVEnet-VQ
View on GitHub
Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"
☆28Feb 22, 2022Updated 4 years ago
telecombcn-dl / 2018-dlsl
View on GitHub
UPC Deep Learning for Speech and Language 2018
☆17Feb 26, 2018Updated 8 years ago
alinourian / Fine-tuning-Mistral-7b-QA
View on GitHub
Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and LoRA(Low-Rank Adaptation) on Puffin Dataset(multi-turn conversation…
☆12Nov 23, 2023Updated 2 years ago
Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆20Nov 4, 2025Updated 8 months ago
nikvaessen / disjoint-mtl
View on GitHub
Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf
☆12Dec 2, 2024Updated last year
uvafan / timelines-takeoff-ai-2027
View on GitHub
☆18Dec 10, 2025Updated 7 months ago
qinenergy / webvision-2020-public
View on GitHub
Webvision Challenge 2020 developer kit
☆10Dec 8, 2022Updated 3 years ago
BoyuanChen / boombox
View on GitHub
Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations
☆15May 18, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
marcoancona / LPDN
View on GitHub
Feedforward implementation of Lightweight Probabilistic Deep Networks for Keras and Tensorflow
☆14Jul 1, 2019Updated 7 years ago
audio-captioning / clotho-dataset
View on GitHub
Python code for handling the Clotho dataset.
☆85Nov 24, 2020Updated 5 years ago
Saurabhbhati / DASS
View on GitHub
☆12Apr 26, 2025Updated last year
kundajelab / keras
View on GitHub
Theano-based Deep Learning library (convnets, recurrent neural networks, and more).
☆14Aug 2, 2017Updated 8 years ago
William-N-Havard / SpeechCoco
View on GitHub
☆12Nov 23, 2020Updated 5 years ago
MahjongKing96 / flipradio.articles
View on GitHub
Some articles by flipradio anchor --- Li HouChen
☆17Mar 26, 2025Updated last year