zhaoyanpeng/vipant

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhaoyanpeng/vipant)

zhaoyanpeng / vipant

VIsually-Pivoted Audio and(N) Text

☆22

Alternatives and similar repositories for vipant

Users that are interested in vipant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qunshansj / Swin-Transformer-Enhanced-YOLO-Power-Tower-Recognition-System
View on GitHub
基于Swin-Transformer改进_YOLOv7电力杆塔识别系统
☆14Nov 27, 2023Updated 2 years ago
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
Abimbola-ai / Oil-and-gas-pipeline-leakage
View on GitHub
☆19Dec 9, 2020Updated 5 years ago
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆42Mar 24, 2023Updated 3 years ago
YChen1993 / ELECRec
View on GitHub
Training Sequential Recommenders as Discriminators (SIGIR'22)
☆15Jul 25, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
albertoCCz / vmd_cvm_python
View on GitHub
Implementation in Python/Cython of the algorithm VMD_CVM for signal denoising
☆12Jul 29, 2022Updated 4 years ago
NINAnor / rare_species_detections
View on GitHub
Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complet…
☆34Dec 28, 2025Updated 7 months ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
nii-yamagishilab / SSL-SAS
View on GitHub
Language independent SSL-based Speaker Anonymization system
☆20May 28, 2024Updated 2 years ago
bkasvenkatesh / Classifying-Environmental-Sounds-with-Image-Networks
View on GitHub
Master Thesis
☆13Feb 20, 2017Updated 9 years ago
c4dm / dcase-few-shot-bioacoustic
View on GitHub
☆61Jul 2, 2024Updated 2 years ago
Kowalski1024 / Mi-Go
View on GitHub
Mi-Go is an open-source test framework designed to evaluate and compare the accuracy of speech-to-text models on YouTube dataset.
☆12Jul 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nttcslab / dcase2023_task2_evaluator
View on GitHub
☆12Aug 10, 2023Updated 2 years ago
fundamentalvision / Siamese-Image-Modeling
View on GitHub
☆16Jul 7, 2023Updated 3 years ago
archival-archetyping / i.frame
View on GitHub
i.frame is an open-source platform for decentralized online events. You can provide cohesion and coherence temporarily to the programs di…
☆12Dec 26, 2021Updated 4 years ago
cfoster0 / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆88Mar 6, 2022Updated 4 years ago
guyAmit / GLOD
View on GitHub
Github for the conference paper GLOD-Gaussian Likelihood OOD detector
☆16Apr 18, 2022Updated 4 years ago
srush / mamba-scans
View on GitHub
Blog post
☆17Feb 16, 2024Updated 2 years ago
sato9hara / PertMap
View on GitHub
Python code for perturbation-based saliency map
☆12Jul 16, 2018Updated 8 years ago
tpt-adasp / salt
View on GitHub
SALT: STANDARDIZED AUDIO EVENT LABEL TAXONOMY
☆16Nov 28, 2024Updated last year
fsct135 / DCAP
View on GitHub
DCAP:Integrating multi-omics data with deep learning for predicting cancer prognosis
☆21Feb 5, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
revsic / jax-variational-diffwave
View on GitHub
Jax/Flax implementation of Variational-DiffWave.
☆40Feb 27, 2022Updated 4 years ago
roudimit / c2kd
View on GitHub
Code for the C2KD paper (ICASSP 2023)
☆20May 15, 2023Updated 3 years ago
alexlioralexli / noncontrastive-ssl
View on GitHub
Analyzing partial dimensional collapse in non-contrastive self-supervised learning. "Understanding Collapse in Non-Contrastive Siamese Re…
☆16Nov 12, 2023Updated 2 years ago
microsoft / NTT
View on GitHub
Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]
☆14Jul 17, 2025Updated last year
RicherMans / CDur
View on GitHub
Repository for the paper "Towards duration robust weakly supervised sound event detection"
☆23Aug 3, 2023Updated 2 years ago
Torabiy / HLS-CMDS
View on GitHub
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
☆19May 13, 2026Updated 2 months ago
anair13 / bullet-manipulation-affordances
View on GitHub
☆13Jun 3, 2022Updated 4 years ago
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
haohao11 / AMENet
View on GitHub
☆11Dec 7, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tasyiann / 2Dto3DMotion
View on GitHub
A repo with Unity3D inspector tools, using OpenPose to predict 3D Character animation motion from 2D figures.
☆10Dec 17, 2021Updated 4 years ago
pmarks-net / incognito-proxy
View on GitHub
Incognito Proxy chrome extension
☆10Sep 27, 2023Updated 2 years ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
duongviet2904 / unity-dragon-ball
View on GitHub
Combat dragon ball game with Unity 2D
☆12Mar 13, 2023Updated 3 years ago
adobe-research / Cross-lingual-Test-Dataset-XTD10
View on GitHub
☆17Dec 22, 2021Updated 4 years ago
xmichelleshihx / AL-LRTD
View on GitHub
Long-range temporal dependency based active learning for surgical workflow recognition
☆10Apr 23, 2020Updated 6 years ago
valterlej / zsarcap
View on GitHub
Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools an…
☆13Mar 8, 2024Updated 2 years ago