blmoistawinde/fense

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/blmoistawinde/fense)

blmoistawinde / fense

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

☆21

Alternatives and similar repositories for fense

Users that are interested in fense are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lukewys / dcase_2020_T6
View on GitHub
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…
☆24Aug 3, 2023Updated 2 years ago
wsntxxn / AudioCaption
View on GitHub
Audio captioning recipe
☆53Oct 23, 2025Updated 9 months ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
audio-captioning / audio-captioning-resources
View on GitHub
A list of resources that can help in research for automated audio captioning
☆34Feb 17, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tpt-adasp / salt
View on GitHub
SALT: STANDARDIZED AUDIO EVENT LABEL TAXONOMY
☆15Nov 28, 2024Updated last year
audio-captioning / audio-captioning-papers
View on GitHub
A list of papers about audio captioning
☆78Jul 1, 2022Updated 4 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
SpeakerRecognizer / VoicePersonification
View on GitHub
☆18Dec 8, 2025Updated 7 months ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
RicherMans / Dasheng
View on GitHub
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
☆86Nov 7, 2025Updated 8 months ago
zeyuxie29 / PicoAudio
View on GitHub
☆45Jan 13, 2025Updated last year
zeyuxie29 / AudioTime
View on GitHub
☆39Jul 4, 2024Updated 2 years ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 9 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
averkij / Word-to-Number-Russian
View on GitHub
Проект для перевода чисел, записанных в текстовом виде на русском языке.
☆11Apr 5, 2022Updated 4 years ago
jaeyeonkim99 / EnCLAP
View on GitHub
Official Implementation of EnCLAP (ICASSP 2024)
☆96Jun 2, 2024Updated 2 years ago
v-manhlt3 / m-LTM-Audio-Text-Retrieval
View on GitHub
☆13Jan 5, 2025Updated last year
cdjkim / audiocaps
View on GitHub
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
☆215Oct 6, 2025Updated 9 months ago
DCASE-REPO / dcase_datalist
View on GitHub
Collection of DCASE related datasets
☆18Jun 12, 2025Updated last year
soham97 / mellow
View on GitHub
small audio language model for reasoning
☆88Dec 4, 2025Updated 7 months ago
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
microsoft / WavText5K
View on GitHub
Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"
☆50Nov 10, 2022Updated 3 years ago
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
malihealikhani / Cross-modal_Coherence_Modeling
View on GitHub
Cross-modal Coherence Modeling for Caption Generation
☆11Jul 24, 2020Updated 6 years ago
NieeiM / Dasheng-Audiogen
View on GitHub
Generate a complete audio clip with music, intelligible speech, and sound effects from text in one pass.
☆44May 27, 2026Updated last month
lijuncheng16 / AudioTaggingDoneRight
View on GitHub
experiments about AudioSet
☆43Jul 22, 2023Updated 3 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
RicherMans / HEAR2021_EfficientLatent
View on GitHub
Submission to the HEAR2021 Challenge
☆17Mar 5, 2022Updated 4 years ago
aditya-AI / Information-Retrieval-System-using-BERT
View on GitHub
☆15Feb 5, 2019Updated 7 years ago
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
Shahabks / Machine-Learning-Algorithm-for-Voice-Analysis
View on GitHub
It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater
☆11Mar 8, 2019Updated 7 years ago
Anup-Deshmukh / TREC_background_linking
View on GitHub
IR-BERT at TREC 2020: Leveraging BERT for Semantic Search in Background Linking
☆15Feb 21, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OFA-Sys / AIR-Bench
View on GitHub
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
☆133Dec 9, 2024Updated last year
jaeyeonkim99 / visage
View on GitHub
Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)
☆47Sep 10, 2025Updated 10 months ago
rulin-jone / Confidential-Laboratory
View on GitHub
This project is created for our Confidential Laboratory, which is supported by HEU
☆13Jul 2, 2018Updated 8 years ago
DexiangHong / MANet
View on GitHub
☆10Aug 23, 2022Updated 3 years ago
liuyoude / TWFR-GMM
View on GitHub
Time-weighted Frequency Domain Audio Representation (TWFR) with GMM Estimator for Anomalous Sound Detection
☆28May 13, 2023Updated 3 years ago
chimechallenge / C8DASR-Baseline-NeMo
View on GitHub
NeMo: a toolkit for conversational AI
☆13May 4, 2024Updated 2 years ago
Sakshi113 / MMAU
View on GitHub
☆156Feb 9, 2026Updated 5 months ago