mangoggul/YOLO-MultiModal

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mangoggul/YOLO-MultiModal)

mangoggul / YOLO-MultiModal

☆13

Alternatives and similar repositories for YOLO-MultiModal

Users that are interested in YOLO-MultiModal are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mcw1217 / MultiModal-YOLO_RGB_Depth_Thermal
View on GitHub
This project uses three types of images as inputs RGB, Depth, and thermal images to perform object detection with YOLOv8.
☆31Jul 23, 2024Updated last year
haozhiwen-fighting / Contrast-enhanced-Ultrasound-for-Thyroid-Nodules-Diagnosis
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
DocF / CMAFF
View on GitHub
Cross-Modality Attentive Feature Fusion for Object Detection in Multispectral Remote Sensing Imagery
☆16Oct 7, 2022Updated 3 years ago
HERIUN / vsumm-reinforce_re
View on GitHub
This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization wit…
☆11Jun 5, 2023Updated 3 years ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zhanghengdev / CFR
View on GitHub
[ICIP 2020]"Multispectral Fusion for Object Detection with Cyclic Fuse-and-Refine Blocks"
☆14Oct 6, 2020Updated 5 years ago
AdityaSinghMandrawal / Object-Detection-System-using-YOLO-V3
View on GitHub
An innovative object detection system for visually impaired individuals. Using YOLO V3 algorithm and the extensive COCO dataset, our syst…
☆12Jul 13, 2024Updated 2 years ago
gbc-iitd / US_UCL
View on GitHub
[MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos
☆11May 28, 2023Updated 3 years ago
leeh43 / Singularity_Deeplesion
View on GitHub
☆11Jun 5, 2021Updated 5 years ago
SiddharthUchil / Vehicle-Counting-YOLOv8-DeepSORT
View on GitHub
Vehicle counting system with YOLOv8 and DeepSORT
☆10Aug 23, 2023Updated 2 years ago
QuincyQAQ / YOLOv8-Multi-Modal-Fusion-Network-RGB-IR
View on GitHub
☆63Nov 26, 2024Updated last year
cl-victor1 / Me
View on GitHub
I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…
☆12Mar 6, 2024Updated 2 years ago
gamalahmed3265 / Flask-Yolov8
View on GitHub
☆13Jun 17, 2023Updated 3 years ago
longbai1006 / CAT-ViL
View on GitHub
Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…
☆18Jul 7, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
atharvguitarist / SpeedVision
View on GitHub
SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…
☆14Sep 22, 2024Updated last year
gonghao51 / gonghao51.github.io
View on GitHub
☆10May 1, 2021Updated 5 years ago
jason-li-831202 / Object-Segmentation-Web
View on GitHub
This project used Yolov8/AnimeGAN and Flask to accomplish the task of background segmentation , background remove and background replacem…
☆12Apr 12, 2024Updated 2 years ago
JeunyuLi / MUAF
View on GitHub
☆15Jun 27, 2023Updated 3 years ago
ifzhang / DCNv2
View on GitHub
Deformable Convolutional Networks v2 with Pytorch
☆33Dec 2, 2020Updated 5 years ago
zzj-dyj / CLF-Net
View on GitHub
☆21Sep 9, 2022Updated 3 years ago
iamgmujtaba / LTC-SUM
View on GitHub
Implementation of LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN
☆22Jul 11, 2023Updated 3 years ago
jgfranco17 / depth-mapping
View on GitHub
3D scene mapping system that using PyTorch's MiDaS model to estimate scene point cloud
☆13Jan 10, 2025Updated last year
doggystyle-star / real_PX4_yolov5
View on GitHub
基于optitrack定位的无人机目标跟踪(target tracking)
☆13Oct 16, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HighnessAtharva / Media-Analysis
View on GitHub
Data Science Exercises based on real-world scenarios with explanatory comments and prettified output.
☆15May 8, 2023Updated 3 years ago
amy-choi / AttackDefenseYOLO
View on GitHub
☆12May 24, 2023Updated 3 years ago
dennisdeneve / AST-GCN
View on GitHub
AST-GCN: Attribute-Augmented Spatiotemporal Graph Convolutional Network for Traffic Forecasting. This is my implementation of this model …
☆11Aug 31, 2023Updated 2 years ago
xbr2017 / DeepLearning-500-questions
View on GitHub
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为17个章节，20多万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系sc…
☆19Nov 12, 2018Updated 7 years ago
AlfredQin / STNet
View on GitHub
☆17Jul 18, 2023Updated 3 years ago
luiscarlosgph / videosum
View on GitHub
Simple video summarisation Python package.
☆25Jan 29, 2024Updated 2 years ago
pangzss / pytorch-CTVSUM
View on GitHub
Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
☆22Jan 7, 2023Updated 3 years ago
nchucvml / STVT
View on GitHub
Video Summarization With Spatiotemporal Vision Transformer
☆23Jul 5, 2023Updated 3 years ago
yyong008 / fastapi-antd-prochat
View on GitHub
A Chat with AI
☆11May 11, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
anbangli / XiaoLoong-DevCpp
View on GitHub
XiaoLoong Dev-C++, improved fork of Orwell Dev-C++
☆21Jan 24, 2026Updated 5 months ago
renqi1 / yolov5_woodscape
View on GitHub
基于yolov5，在woodscape数据集上实现旋转框目标检测+语义分割
☆13Mar 4, 2024Updated 2 years ago
akbartus / Yolov8-Pose-Detection-on-Browser
View on GitHub
Example of YOLOv8 pose detection (estimation) on browser. It shows implementations powered by ONNX and TFJS served through JavaScript wit…
☆15Jun 9, 2024Updated 2 years ago
theopsall / Video-Summarization
View on GitHub
Multimodal summarization of user-generated videos from wearable cameras
☆23Jun 22, 2025Updated last year
FutureTwT / PyTorch-Geometric-Study
View on GitHub
关于Pytorch-Geometric的学习，包括官方文档的基本内容和部分API的使用方式，以及官方源码中的示例代码和Pytorch-Geometric的部分源码实现
☆21Dec 2, 2020Updated 5 years ago
adrian-soch / 3D-LiDAR-YOLOv8-obb
View on GitHub
3D LiDAR Object Detection using YOLOv8-obb (oriented bounding box).
☆16Sep 6, 2024Updated last year
DSL-Lab / echoglad
View on GitHub
EchoGLAD: Hierarchical Graph Neural Networks for Left Ventricle Landmark Detection on Echocardiograms
☆21Apr 17, 2023Updated 3 years ago