infinigence / Infini-MegrezLinks
☆310Updated 5 months ago
Alternatives and similar repositories for Infini-Megrez
Users that are interested in Infini-Megrez are comparing it to the libraries listed below
Sorting:
- ☆230Updated 3 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆243Updated 7 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 5 months ago
- GLM Series Edge Models☆141Updated 3 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆329Updated last month
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆161Updated 6 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆234Updated 3 months ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆178Updated this week
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆651Updated 9 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆169Updated last month
- Alpaca Chinese Dataset -- 中文指令微调数据集☆205Updated 7 months ago
- GPT-4o-level, real-time spoken dialogue system.☆327Updated 4 months ago
- Mixture-of-Experts (MoE) Language Model☆188Updated 8 months ago
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆274Updated 4 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆253Updated this week
- ☆247Updated 9 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 10 months ago
- ☆132Updated 3 months ago
- Phi3 中文后训练模型仓库☆321Updated 6 months ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆479Updated 2 months ago
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆134Updated 3 weeks ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated 3 weeks ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆222Updated this week
- Collect every awesome work about r1!☆372Updated last month
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆75Updated 2 weeks ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆262Updated last year
- ☆173Updated 3 months ago
- 🌐 WebWalker [ACL2025] & WebDancer [Preprint]☆421Updated this week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆203Updated 3 months ago
- ☆143Updated last year