Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
☆22Apr 6, 2020Updated 5 years ago
Alternatives and similar repositories for VIG
Users that are interested in VIG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Website for TextVQA dataset.☆28Apr 30, 2023Updated 2 years ago
- Dilated DenseNets☆15Dec 30, 2017Updated 8 years ago
- ☆17Nov 17, 2020Updated 5 years ago
- Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction☆12Aug 1, 2018Updated 7 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- BISON: Binary Image SelectiON☆49Sep 15, 2021Updated 4 years ago
- logboard: Monitor and Compare Logs on Browser/Terminal.☆21Sep 19, 2019Updated 6 years ago
- Coach compensation calculator using Vue and d3☆10Jan 3, 2023Updated 3 years ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated 11 months ago
- Latex resume template☆12Mar 29, 2012Updated 13 years ago
- ☆14Apr 6, 2014Updated 11 years ago
- ☆10Dec 21, 2019Updated 6 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆72Apr 22, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- Audio-conditioned video texture generation☆24Sep 16, 2022Updated 3 years ago
- Mixtures of von Mises-Fisher Distributions☆12Mar 23, 2015Updated 11 years ago
- ☆19Feb 6, 2019Updated 7 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Nov 20, 2023Updated 2 years ago
- Other than papers from big-name labs and universities, most AI research papers get less than 10 readers, even though there might be gems …☆15Jul 20, 2018Updated 7 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- Prose for a painting source code☆12Oct 8, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆20Oct 18, 2021Updated 4 years ago
- New tex4ht documentation☆12Nov 20, 2025Updated 4 months ago
- Torch bindings for FFmpeg (reading videos only)☆26Jul 13, 2016Updated 9 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago
- Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)☆12Sep 1, 2024Updated last year
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 2 years ago
- My own take on something like AI Dungeon.☆12Jul 27, 2020Updated 5 years ago
- Resources for recent AI systems (deployment concerns, cost and accessibility). -- closed☆12May 29, 2021Updated 4 years ago
- Learning Dynamic Generator Model by Alternating Back-Propagation Through Time☆11Dec 26, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This package calculates the position and orientation for a robot where it is most likely to find and recognize an object☆12Jan 13, 2020Updated 6 years ago
- PyTorch implmentation of LocoGAN: https://arxiv.org/abs/2002.07897☆11Feb 8, 2021Updated 5 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Dec 14, 2022Updated 3 years ago
- ☆13Sep 17, 2021Updated 4 years ago
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- The method of text-to-image☆48Dec 19, 2019Updated 6 years ago
- Oculus-Rift - Gazebo Navigator (PS3 controller & keyboard op)☆11Oct 9, 2014Updated 11 years ago