人工智能实验五:多模态情感分类
☆16Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for ai_lab5
Users that are interested in ai_lab5 are comparing it to the libraries listed below
Sorting:
- ☆15Apr 28, 2023Updated 2 years ago
- 使用pytorch完成的一个多模态分类任务,文本和图像部分分别使用了bert和resnet提取特征(在config里可以组合多种模型),在我的小规模数据集上取得了良好的性能(验证集acc96%)☆83Mar 25, 2023Updated 2 years ago
- 一个用YOLO足球视频分析的任务,检测视频中的人与球。 A task of football video analysis to detect people and balls in the video with YOLO☆12Sep 5, 2020Updated 5 years ago
- A django-yolov5 starter webapp. Based on yolov5-flask example.☆11Mar 6, 2022Updated 4 years ago
- 本项目基于RuoYi-Vue框架为xiaozhi-esp32提供Java后端聊天服务器。帮助个人、企业快速部署的xiaozhi-esp32后端服务。☆21Jun 19, 2025Updated 8 months ago
- Mediapipe 0.10.1 with CUDA GPU Support python libs☆10Dec 1, 2023Updated 2 years ago
- 疲劳检测☆11Jan 9, 2022Updated 4 years ago
- ☆14Jan 6, 2025Updated last year
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- machine_learn☆13Aug 20, 2022Updated 3 years ago
- This repo would give multi-task keypoint detect code based yolov8. The landmarks or keypoints with different classes and numbers can be …☆12Feb 28, 2023Updated 3 years ago
- ☆10Feb 13, 2025Updated last year
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Oct 25, 2024Updated last year
- 基于yolov5和django框架的web端人脸识别并打码系统☆10Aug 11, 2023Updated 2 years ago
- ☆13Mar 9, 2024Updated 2 years ago
- 基于Yolov5-Deepsort-Fastreid源码,重构了视频行人MOT和行人ReID特征提取代码、接口☆14Mar 15, 2023Updated 2 years ago
- 人脸识别、人脸细粒度表情识别、异常行为检测和识别☆12Apr 11, 2022Updated 3 years ago
- Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition (TMM 2024)☆16Aug 13, 2024Updated last year
- Submodule for Grounded-SAM☆12Apr 17, 2023Updated 2 years ago
- Violence detection using CNN+LSTM model☆11Oct 11, 2022Updated 3 years ago
- SolidUI documents.☆12Aug 1, 2023Updated 2 years ago
- 使用tensorflow.js训练AI模型,包括线性回归、逻辑回归、图像识别、语音识别、迁移学习、加载已有模型、模型转化等等☆12Jan 26, 2023Updated 3 years ago
- ☆13Sep 6, 2025Updated 6 months ago
- 专注于解决数字图像处理领域的几个核心问题:图像转换,图像相似度,目标定位,目标检测,图像分割,图像聚类和图像分类. 为相关领域的研发人员提供完整的通用设计与参考实现. 涵盖了多种数字图像处理,计算机视觉,计算机图形算法.☆15Jul 31, 2022Updated 3 years ago
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- Used LSTM Network to classify eeg signals based on stimuli the subject recieved (visual or audio)☆16Dec 12, 2017Updated 8 years ago
- 将零一万物 YI-34B 模型 API 转换为各种使用 OpenAI API 的开源软件支持的格式,无需修改开源软件配置或代码。☆12Jan 13, 2024Updated 2 years ago
- 目标检测算法主要包括:两类two-stage和one-stage 一类是two-stage,two-stage检测算法将检测问题划分为两个阶段,首先产生候选区域(region proposals),然后对候选区域分类(一般还需要对位置精修),这一类的典型代表是R-CNN…☆15Sep 5, 2021Updated 4 years ago
- A very simple single target tracking tool and UI interface(only STARK currently)☆13Jun 9, 2022Updated 3 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- SpeakControl 是基于ssfrpa开发的可自定义任意指令的语音控制,指令可以是简单的运行某程序、也可以是复杂流程逻辑。☆18Jul 13, 2025Updated 7 months ago
- 探索智能零售领域的图像识别方案,从而让机器更精准地识别商品,通过更快捷地购物带来全新的用户体验。☆12Jun 15, 2021Updated 4 years ago
- Run SOTA Vision-Language Model Florence-2 on your data!☆15Mar 27, 2025Updated 11 months ago
- This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human …☆21Aug 16, 2025Updated 6 months ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 2 years ago
- 基于Pytorch对运行环境搭建,选用VMRD数据集作为实验数据集,实现视觉操作关系推理,并可以获取场景下的操作关系树,ROI检测提取结构:使用Cascade R-CNN级联网络实现物体的目标检测,特征提取后通过抓取提议网络得到潜在ROIs,再进入级联网络中完成目标检测。抓…☆11Jul 12, 2022Updated 3 years ago
- 后端python flask 框架吐API接口给前端,前端用vue+echarts实现数据可视化☆10Mar 1, 2024Updated 2 years ago
- C#客户端程序,用于启动RealSense摄像头、采集视频图像流、采集人脸特征点信息,并将该客户端发布为Http服务器,提供GET请求接口和Post请求接口,以供JS或者web调用☆13Aug 11, 2018Updated 7 years ago