wanghao-cst / Omni-VideoAssistant
Video QA Assistant based on LLMs with frame convolution
☆210Updated last year
Alternatives and similar repositories for Omni-VideoAssistant:
Users that are interested in Omni-VideoAssistant are comparing it to the libraries listed below
- Tools, Templates, and Experiments of LangGPT☆229Updated 8 months ago
- The script for selenium in python. Make automated testing easier! 使用json脚本驱动selenium☆164Updated 9 months ago
- A Template Based Report Rendering Platform.☆330Updated 9 months ago
- ☆117Updated 9 months ago
- 🤖Linux command line query scaffolding☆67Updated 8 months ago
- ☆314Updated last year
- metasequoia-sql 是一款注重性能的 SQL 语法的解析和分析器,适用于 SQL 的格式化、执行和分析场景,致力于打造性能最高的 Python 版 SQL 解析器。☆227Updated 7 months ago
- EasyDeploy is engineered to provide users with end-to-end deployment capabilities for large-scale models.☆128Updated 2 months ago
- EAGLEEYE图像应用开发框架☆158Updated 4 months ago
- OCR(光学字符识别)训练样本生成器,可自动生成用于训练OCR检测和识别模型的图片样本和标注☆132Updated 7 months ago
- ☆204Updated 4 months ago
- 本框架是一种针对数学公式解析的有效工具 支持 Java python C++ API ,能够解析包含嵌套函数,包含函数,数列步长累加等数学公式,返回值是一个数值的结果对象,同时也可以进行比较运算的操作,再进行比较的时候,返回值是一个布尔值结果对象。PS 请尽量使用 1.3…☆186Updated last month
- Machine Learning Experiment Manage Platform☆318Updated last week
- 这是一个使用纯Rust编写的读屏(Screen Reader)项目,用于视力有障碍的人群操作电脑,软件会将屏幕上的各种信息转换成语音输出。☆166Updated 2 months ago
- 相比于SpringCloud Gateway更加轻量级、性能更强的API网关☆178Updated 8 months ago
- 基于imgur的web图片托管页面,即开即用 https://wishmelz.github.io/imgur☆205Updated last year
- A simple and user-friendly underscore variable naming tool.☆182Updated 9 months ago
- 本项目开源基于NextJS的前端, 希望能够提供一个用于生成式AI的文字转视频, 尤其是电影从编剧到视频生成的Web前端平台参考。Everyone can become a director. The Nextjs front-end of an AI driven pla…☆192Updated last year
- ☆348Updated 7 months ago
- 智音语音助手(Zhiyin_Butler)旨在开发一款通用型智能电脑管家,支持在桌面电脑Windows 10/11系统上安装和部署。项目的所有内容遵循Apache License 2.0开源协议,作为通用型电脑管家系统示例供开发者参考学习。☆151Updated 4 months ago
- Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024☆120Updated 3 weeks ago
- Official code for TPAMI2024 paper: Pixel Distillation: Cost-flexible Distillation across Image Sizes and Heterogeneous Networks☆56Updated 4 months ago
- [ICLR 2025] What do we expect from LMMs as AIGI evaluators and how do they perform?☆143Updated last month
- 简单的静态博客生成器☆171Updated last year
- 身份证OCR智能识别、证件提取以及验证码自动化解析功能,项目核心基于深度学习技术。从数据采集、数据标注 、模型训练、模型度量 、模型服务部署 全流程欢迎讨论。所有自训练模型、finetune欢迎自取使用,并持续关注我输出的更多模型。V:chenganp☆157Updated 3 months ago
- Biomedical Generalist Video Generation Model☆179Updated 5 months ago
- ☆199Updated 9 months ago
- 我的Github个人主页☆191Updated this week
- [AAAI'25 Oral] "MIA-Tuner: Adapting Large Language Models as Pre-training Text Detector".☆142Updated last week
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆104Updated last month