mahdeslami11/acoustic-model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mahdeslami11/acoustic-model)

mahdeslami11 / acoustic-model

☆44

Alternatives and similar repositories for acoustic-model

Users that are interested in acoustic-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndyTang15 / FLAG3D
View on GitHub
☆19Jun 22, 2026Updated last month
AndyTang15 / FLAG3Dv2
View on GitHub
☆25May 9, 2024Updated 2 years ago
RuiChen96 / FingER
View on GitHub
[ACM MM 25] FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos
☆17Jul 17, 2025Updated last year
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆20Dec 9, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
GuanxingLu / Subspace-Clustering
View on GitHub
[IEEE TCSVT 2023] The implementation of our paper Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation.
☆26Dec 21, 2023Updated 2 years ago
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 11 months ago
InvincibleWyq / ChatVID
View on GitHub
Chat about anything on any video!
☆39Sep 5, 2023Updated 2 years ago
AMAP-ML / Peak-End-Net
View on GitHub
[ACM MM 2026] Peak-End-Net: A Peak-End Rule Inspired Framework for Generalizable Video Aesthetic Assessment
☆26Jul 17, 2026Updated 2 weeks ago
moonsliu / Pro-Motion
View on GitHub
Plan, Posture and Go: Towards Open-World Text-to-Motion Generation
☆42Nov 19, 2024Updated last year
wysoczanska / clip-diy
View on GitHub
Official implementation of the WACV 2024 paper CLIP-DIY
☆34Dec 20, 2023Updated 2 years ago
shiyi-zh0408 / NAE_CVPR2024
View on GitHub
[CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
☆43May 16, 2024Updated 2 years ago
April-Yz / ManiGaussian_Bimanual
View on GitHub
[IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model
☆46Jun 26, 2025Updated last year
EternalEvan / DPMesh
View on GitHub
The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery", CVPR 2024
☆45Jun 4, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
thunlp / KARL
View on GitHub
KARL: Knowledge-Aware Reasoning and Reinforcement Learning for Knowledge-Intensive Visual Grounding
☆68Apr 5, 2026Updated 3 months ago
UCSC-VLAA / m1
View on GitHub
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models
☆51Dec 21, 2025Updated 7 months ago
aim-uofa / SINE
View on GitHub
[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples
☆68Oct 29, 2024Updated last year
HVision-NKU / Cascade-CLIP
View on GitHub
Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
☆58Aug 15, 2024Updated last year
yongliu20 / Awesome-Unified-Understanding-and-Generation
View on GitHub
☆52Aug 22, 2025Updated 11 months ago
jiaosiyu1999 / MAFT
View on GitHub
☆60Aug 12, 2024Updated last year
DongSky / LPT
View on GitHub
☆56Oct 5, 2022Updated 3 years ago
UCSC-VLAA / MedVLThinker
View on GitHub
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
☆60Dec 21, 2025Updated 7 months ago
thunlp / Migician
View on GitHub
[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
☆90May 20, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yongliu20 / SCAN
View on GitHub
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆77Sep 23, 2024Updated last year
mc-lan / ClearCLIP
View on GitHub
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
☆100Mar 26, 2025Updated last year
shiyi-zh0408 / Meta-CoT
View on GitHub
[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"
☆79May 6, 2026Updated 2 months ago
zwyang6 / SeCo
View on GitHub
[CVPR2024] Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentatio…
☆81Oct 10, 2024Updated last year
zwyang6 / ExCEL
View on GitHub
[CVPR2025] Exploring CLIP’s Dense Knowledge for Weakly Supervised Semantic Segmentation
☆69Jun 21, 2025Updated last year
Haochen-Wang409 / TreeVGR
View on GitHub
[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
☆91Jan 26, 2026Updated 6 months ago
ChangyuanWang17 / QVLM
View on GitHub
[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.
☆102Jan 3, 2025Updated last year
linyq2117 / TagCLIP
View on GitHub
[AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
☆116Jan 9, 2024Updated 2 years ago
linyq2117 / SAMRefiner
View on GitHub
[ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement
☆100Apr 19, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shijxcs / LIFT
View on GitHub
Source code for the paper "Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts" (ICML 2024)
☆110Oct 22, 2024Updated last year
showlab / VideoLISA
View on GitHub
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆149Dec 26, 2024Updated last year
WalBouss / GEM
View on GitHub
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆139Apr 10, 2025Updated last year
sunanhe / MKT
View on GitHub
Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".
☆129Nov 7, 2024Updated last year
dvruette / gidd
View on GitHub
Code accompanying the paper "Generalized Interpolating Discrete Diffusion"
☆121Jun 9, 2025Updated last year
lxtGH / DenseWorld-1M
View on GitHub
Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"
☆129Oct 2, 2025Updated 9 months ago
EternalEvan / FlowIE
View on GitHub
[CVPR 2024 oral]This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"
☆153Jan 13, 2025Updated last year