forXuyx / Cinego
🚀 轻量视频🎥 大模型🤖
☆10Updated this week
Alternatives and similar repositories for Cinego:
Users that are interested in Cinego are comparing it to the libraries listed below
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码 和数据。☆54Updated 7 months ago
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆10Updated 2 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆62Updated last month
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models☆110Updated last week
- Parameter-Efficient Fine-Tuning for Foundation Models☆53Updated 2 weeks ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆42Updated 2 months ago
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆22Updated 4 months ago
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆34Updated this week
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated 11 months ago
- ☆28Updated 11 months ago
- Awesome-RAG-VIsion: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆123Updated last week
- 使用FastAPI+vLLM部署Qwen2.5☆12Updated 6 months ago
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆52Updated 2 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆102Updated last week
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆50Updated this week
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆120Updated 5 months ago
- An open source implementation of R1☆19Updated last week
- Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and Dee…☆48Updated last month
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆38Updated 6 months ago
- ☆40Updated 2 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 6 months ago
- ☆47Updated this week
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆14Updated 10 months ago
- 《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞☆199Updated 4 months ago
- ☆34Updated last month
- 集中管理所有的prompt。☆13Updated 4 months ago
- ☆22Updated 8 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆22Updated 5 months ago
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆11Updated 3 months ago