Kaikaikaifang / divine-agentLinks
Creating Your Divine Agent 😇
☆10Updated last month
Alternatives and similar repositories for divine-agent
Users that are interested in divine-agent are comparing it to the libraries listed below
Sorting:
- SwanLab Local Visualization Python Package Plugin|SwanLab本地可视化python包插件☆13Updated 3 months ago
- Our 2nd-gen LMM☆33Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- ☆29Updated 10 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- ☆35Updated 9 months ago
- ☆13Updated 10 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆128Updated last week
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆44Updated 3 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated 7 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Updated last year
- Daily tracking of awesome aigc papers, including video generation, video editing, animation.☆22Updated last month
- Music large model based on InternLM2-chat.☆22Updated 6 months ago
- ☆15Updated 3 months ago
- MLLM @ Game☆14Updated last month
- Official PyTorch implementation of EMOVA in CVPR 2025 (https://arxiv.org/abs/2409.18042)☆54Updated 3 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆68Updated 2 weeks ago
- ☆102Updated this week
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆48Updated 9 months ago
- Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM☆16Updated 4 months ago
- Distill thinking dataset more compactly and accurately!☆31Updated 3 weeks ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆34Updated last year
- mllm-npu: training multimodal large language models on Ascend NPUs☆90Updated 10 months ago
- ☆72Updated last month
- ☆22Updated 4 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆47Updated 3 months ago
- [ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆70Updated 4 months ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆52Updated this week
- ☆17Updated 2 months ago
- LMM solved catastrophic forgetting, AAAI2025☆43Updated 2 months ago