一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR
☆198Jan 30, 2026Updated 3 months ago
Alternatives and similar repositories for adan_application
Users that are interested in adan_application are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目使用大语言模型完成了复杂任务的长程工具调用☆117Oct 16, 2025Updated 7 months ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆48Nov 10, 2025Updated 6 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆308Jul 1, 2025Updated 10 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆28Jan 4, 2026Updated 4 months ago
- ☆21Jan 25, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 使用langraph构建Agentic-RAG☆23Jul 30, 2025Updated 9 months ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone☆24,700May 12, 2026Updated last week
- MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks☆8,871Feb 11, 2026Updated 3 months ago
- ☆221Nov 25, 2025Updated 5 months ago
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 8 months ago
- Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/191…☆13Aug 20, 2020Updated 5 years ago
- Style-Text data synthesis tool☆78Dec 9, 2024Updated last year
- ☆29Aug 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 10 months ago
- Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…☆13Jul 22, 2024Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Jul 19, 2023Updated 2 years ago
- ☆11Oct 13, 2023Updated 2 years ago
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆161Mar 30, 2026Updated last month
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆950Aug 3, 2025Updated 9 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆129Jun 4, 2025Updated 11 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆10,024Sep 22, 2025Updated 7 months ago
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,951Jan 24, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- 欢迎来到 RAG 检索增强生成!这是一个使用 OpenAI API 和 Milvus 向量数据库的问答系统,结合了检索增强生成(RAG)技术。☆10Nov 4, 2024Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Jul 23, 2024Updated last year
- ☆37Nov 25, 2025Updated 5 months ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆33Aug 21, 2025Updated 8 months ago
- Finetune and Inference Qwen3-0.6B.☆28May 5, 2025Updated last year
- Re-implement DAGAN in the PyTorch☆13Jan 29, 2022Updated 4 years ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,122Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆13Feb 27, 2024Updated 2 years ago
- 基于LLM的多轮问答系统。结合了意图识别和词槽填充技术☆22Jul 30, 2025Updated 9 months ago
- ☆42Jun 15, 2024Updated last year
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆61Mar 29, 2026Updated last month
- Vision-Language-Action Optimization with Trajectory Ensemble Voting☆27Feb 18, 2026Updated 3 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆62Apr 3, 2026Updated last month
- A family of lightweight multimodal models.☆1,053Nov 18, 2024Updated last year