Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。
☆26Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for Real-Gemini
Users that are interested in Real-Gemini are comparing it to the libraries listed below
Sorting:
- scrapy、pyspider、appium、beautiful soup、selenium、uiautomator2等爬虫技术。漏洞信息、威胁情报、舆情分析、自媒体平台信息、电商平台商品信息等爬虫。☆10Oct 20, 2023Updated 2 years ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 2 years ago
- 使用OpenCV+onnxruntime部署开放域目标检测,包含C++和Python两个版本的程序☆11Jan 4, 2024Updated 2 years ago
- 基于Pytorch对运行环境搭建,选用VMRD数据集作为实验数据集,实现视觉操作关系推理,并可以获取场景下的操作关系树,ROI检测提取结构:使用Cascade R-CNN级联网络实现物体的目标检测,特征提取后通过抓取提议网络得到潜在ROIs,再进入级联网络中完成目标检测。抓…☆11Jul 12, 2022Updated 3 years ago
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Feb 6, 2026Updated last month
- ComfyUI界面汉化 中文简体版☆13May 22, 2024Updated last year
- 计数行人+划出轨迹+变成鸟瞰图☆19Jul 6, 2021Updated 4 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆19Dec 20, 2024Updated last year
- Fire-Detection-using-YOLOv8☆51Feb 2, 2023Updated 3 years ago
- 中文:这个软件运行后能悬浮在所有软件之上,可以自动控制安卓系统下安装的微信,对指定微信发的每个语音进行录制后转发到指定的微信群,全程自动化,减少人工成本,当然这个软件是我很多年前(约2016年)自己开发的,现在的安卓系统和微信早就更新了不知道多少版了,所以现在应该是不能控制…☆20Jul 30, 2022Updated 3 years ago
- BlockchainGPT: An intuitive, chat-based platform to manage your blockchain environments using natural language processing capabilities.☆11Jul 6, 2023Updated 2 years ago
- A 6 degree of freedom (DOF) robot arm is controlled using the ROS2 robotic manipulation platform, MoveIt 2. The ROS2 Humble version of Mo…☆28Sep 9, 2024Updated last year
- EdgeYOLO + ROS 2 object detection package☆29Mar 28, 2023Updated 2 years ago
- 微信公众号文章爬虫,爬取公众号文章信息,用来获取到公众号的最新文章(支持爬取多个公众号),但本项目不支持获取到文章点赞数,和阅读量☆25Apr 11, 2024Updated last year
- 使用 chromedp 获取热门榜单,生成 API☆31Mar 6, 2023Updated 2 years ago
- 3D Gaussian Splatting for underwater scene reconstruction via physcial-based appearance-medium decoupling☆23Feb 13, 2026Updated 3 weeks ago
- ☆12Mar 4, 2024Updated 2 years ago
- ☆36Feb 6, 2026Updated 3 weeks ago
- ☆30Dec 16, 2025Updated 2 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Nov 7, 2023Updated 2 years ago
- 给定文本和图片,自动生成一个相应的含字幕的短视频。完全自动化,可批量生成☆27Jun 29, 2024Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆72Dec 7, 2023Updated 2 years ago
- Run zero-shot prediction models on your data☆36Dec 19, 2024Updated last year
- 定时爬取百度搜索风云榜实时热点.☆36Updated this week
- 排班管理系统☆11Jul 15, 2015Updated 10 years ago
- 生成短视频(翻译字幕,生成AI语音,图片合成动态视频,反向解析视频中单图)☆33Oct 10, 2023Updated 2 years ago
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- An Online Tool (using GitHub Actions) to inject Generic System image to the super.img for your Android device if Android Fastbootd fails…☆14Feb 1, 2026Updated last month
- AI开发者平台。目的是要搭建一个采集视频图像并调用API进行智能化数据标注,训练完成之后进行自动化测试的平台。☆34Mar 16, 2018Updated 7 years ago
- 中国机器人及人工智能大赛全地形自适应机器人赛道☆12Apr 26, 2023Updated 2 years ago
- 中医智慧诊疗小程序后端☆10Aug 28, 2022Updated 3 years ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Arduino library for Gavesha® Robomatics Gear Motor.☆10Feb 15, 2025Updated last year
- ☆14Aug 10, 2025Updated 6 months ago
- online shopping tool for flowers and gifts☆11Nov 13, 2017Updated 8 years ago
- Clear Face is python project with C++ library for tracking faces and multiple models detection from faces☆31Oct 3, 2023Updated 2 years ago
- share data, prompt data , pretraining data☆36Nov 30, 2023Updated 2 years ago
- 一个使用yolov5模型和deepsort算法的车辆检测项目,附带处理好的数据集