Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。
☆27Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for Real-Gemini
Users that are interested in Real-Gemini are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🍅 移动端部署,支持YOLOv5s、YOLOv4-tiny、MobileNetV2-YOLOv3-nano、Simple-Pose与Yolact模型,支持iOS、Android,使用NCNN框架。☆13Aug 20, 2020Updated 5 years ago
- 使用OpenCV+onnxruntime部署开放域目标检测,包含C++和Python两个版本的程序☆11Jan 4, 2024Updated 2 years ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- 实现一个自己的小语言模型☆11Jun 15, 2024Updated last year
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于Pytorch对运行环境搭建,选用VMRD数据集作为实验数据集,实现视觉操作关系推理,并可以获取场景下的操作关系树,ROI检测提取结构:使用Cascade R-CNN级联网络实现物体的目标检测,特征提取后通过抓取提议网络得到潜在ROIs,再进入级联网络中完成目标检测。抓…☆11Jul 12, 2022Updated 3 years ago
- An reimplement of liif(Learning Continuous Image Representation with Local Implicit Image Function) using lightning+hydra☆11Mar 26, 2021Updated 5 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- 🔥🔥🔥Python+Yolov5火焰烟雾识别检测 fire/flame smoke recognition and detection☆31Dec 23, 2025Updated 3 months ago
- demo☆10Mar 23, 2018Updated 8 years ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- scrapy、pyspider、appium、beautiful soup、selenium、uiautomator2等爬虫技术。漏洞信息、威胁情报、舆情分析、自媒体平台信息、电商平台商品信息等爬虫。☆10Oct 20, 2023Updated 2 years ago
- EdgeYOLO + ROS 2 object detection package☆29Mar 28, 2023Updated 3 years ago
- Just messing around with PyTorch 1.0's JIT compiler and their new C++ API Libtorch.☆19Dec 19, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆53Sep 8, 2022Updated 3 years ago
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆18Mar 3, 2024Updated 2 years ago
- ☆10Aug 24, 2023Updated 2 years ago
- ALTER: Auxiliary Text Rewriting Tool for Natural Language Generation☆16Dec 10, 2022Updated 3 years ago
- [MICCAI 2025] FEAT:Full-Dimensional Efficient Attention Transformer for Medical Video Generation.☆23Sep 24, 2025Updated 6 months ago
- An open-source implementaion for fine-tuning SmolVLM.☆65Sep 12, 2025Updated 7 months ago
- 《Tensorflow+Keras深度学习人工智能实践应用》书籍附赠源码,自己每一章敲的代码以及所需要的数据文件☆14Oct 28, 2019Updated 6 years ago
- 计数行人+划出轨迹+变成鸟瞰图☆19Jul 6, 2021Updated 4 years ago
- 中文:这个软件运行后能悬浮在所有软件之上,可以自动控制安卓系统下安装的微信,对指定微信发的每个语音进行录制后转发到指定的微信群,全程自动化,减少人工成本,当然这个软件是我很多年前(约2016年)自己开发的,现在的安卓系统和微信早就更新了不知道多少版了,所以 现在应该是不能控制…☆20Jul 30, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Long text summarization using pointer generator networks☆12Aug 11, 2018Updated 7 years ago
- train gpt-2 in colab☆13Apr 6, 2019Updated 7 years ago
- ☆22Oct 10, 2020Updated 5 years ago
- PreciseCam: Precise Camera Control for Text-to-Image Generation☆25May 7, 2025Updated 11 months ago
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆60Feb 20, 2024Updated 2 years ago
- A DCGAN implementing all the tricks from recent papers up to 2020 and from all over the internet. Trained on CelebA at 157x128. "GAN Hack…☆14Nov 6, 2020Updated 5 years ago
- A 6 degree of freedom (DOF) robot arm is controlled using the ROS2 robotic manipulation platform, MoveIt 2. The ROS2 Humble version of Mo…☆29Sep 9, 2024Updated last year
- Fire-Detection-using-YOLOv8☆52Feb 2, 2023Updated 3 years ago
- 仿微信底部tab切换栏☆10Mar 31, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of ECCV 2024 paper "Confidence-Based Iterative Generation for Real-World Image Super-Resolution"☆16Nov 17, 2024Updated last year
- 2019达观杯实体识别☆19Sep 12, 2019Updated 6 years ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Nov 7, 2023Updated 2 years ago
- Tracking the latest and greatest research papers on diffusion large language models.☆33Mar 13, 2026Updated last month
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- WordPress plugin to add useful decoration features to the Gutenberg RichText editor toolbar.☆10Updated this week
- Utilities and configuration for running puppeteer against WordPress☆10Dec 6, 2022Updated 3 years ago