一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR
☆197Jan 30, 2026Updated 4 months ago
Alternatives and similar repositories for adan_application
Users that are interested in adan_application are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目使用大语言模型完成了复杂任务的长程工具调用☆118Oct 16, 2025Updated 8 months ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆48Nov 10, 2025Updated 7 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆31Jan 4, 2026Updated 5 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆83Jul 4, 2024Updated last year
- A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone☆25,717Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MiniCPM5-1B: A SOTA 1B on-device LLM, small yet powerful.☆9,501Jun 20, 2026Updated last week
- 使用django对情感分析功能进行封装,里面包含使用情感词典和Bert模型进行情感分类,最后可以使用tensorFlow serving将模型部署在docker中运行。☆12Sep 23, 2019Updated 6 years ago
- ☆123May 29, 2025Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated last year
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆21Sep 1, 2025Updated 9 months ago
- Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net) [https://arxiv.org/abs/191…☆13Aug 20, 2020Updated 5 years ago
- Official repository for ACM Multimedia'24 paper "MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube a…☆22Aug 11, 2024Updated last year
- ☆29Aug 19, 2024Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 这是一个用于计算ViT及其变种模型的GradCAM自动脚本,可以自动处理批量的图像 A GradCAM automatic script to visualize the model result☆18Dec 16, 2024Updated last year
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Jul 19, 2023Updated 2 years ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆76Feb 10, 2025Updated last year
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆955Aug 3, 2025Updated 10 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆129Jun 4, 2025Updated last year
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开 源多模态对话模型☆10,074Sep 22, 2025Updated 9 months ago
- Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)☆1,948Jun 2, 2026Updated 3 weeks ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆28Jul 23, 2024Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- qwen-nsa☆87Oct 14, 2025Updated 8 months ago
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆33Aug 21, 2025Updated 10 months ago
- TinyML and Efficient Deep Learning Computing | MIT 6.S965/6.5940☆45Jun 12, 2026Updated 2 weeks ago
- Finetune and Inference Qwen3-0.6B.☆29May 5, 2025Updated last year
- Re-implement DAGAN in the PyTorch☆13Jan 29, 2022Updated 4 years ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,633Updated this week
- UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience☆75Apr 3, 2026Updated 2 months ago
- 多模态 MM +Chat 合集☆283Updated this week
- ☆45Jul 28, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基 于LLM的多轮问答系统。结合了意图识别和词槽填充技术☆24Jul 30, 2025Updated 10 months ago
- ☆43Jun 15, 2024Updated 2 years ago
- Vision-Language-Action Optimization with Trajectory Ensemble Voting☆26Feb 18, 2026Updated 4 months ago
- A family of lightweight multimodal models.☆1,053Nov 18, 2024Updated last year
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆30Sep 24, 2025Updated 9 months ago
- Train a 1B LLM with 1T tokens from scratch by personal☆807Apr 27, 2025Updated last year
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆68Apr 3, 2026Updated 2 months ago