π LLM-I: Transform LLMs into natural interleaved multimodal creators! β¨ Tool-use framework supporting image search, generation, code execution & editing
β41Oct 20, 2025Updated 7 months ago
Alternatives and similar repositories for LLM-I
Users that are interested in LLM-I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.β12Nov 27, 2024Updated last year
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Informationβ13Oct 1, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasksβ52Sep 4, 2025Updated 9 months ago
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"β140Sep 4, 2025Updated 9 months ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddingsβ39Sep 13, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Poet: Product-oriented Video Captioner for E-commerceβ12Sep 21, 2020Updated 5 years ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflectionβ56Aug 16, 2025Updated 9 months ago
- Official repository for βReasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Spaceββ18Jan 27, 2026Updated 4 months ago
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filtersβ16Jan 14, 2021Updated 5 years ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Samplβ¦β22Aug 5, 2025Updated 10 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]β63Jul 4, 2025Updated 11 months ago
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"β19Mar 10, 2025Updated last year
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USEβ62Nov 5, 2025Updated 7 months ago
- Enemies for your LLMβ37Jan 20, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RAG methods, benchmarks, and toolkitsβ19Nov 28, 2024Updated last year
- β13Jun 15, 2021Updated 4 years ago
- β11Mar 26, 2020Updated 6 years ago
- A yolov5 based application, it uses the prediction results by yolov5 to activate the selected opencv built-in tracking algorithm.β10Jul 24, 2020Updated 5 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Codeβ¦β13May 25, 2025Updated last year
- β12Oct 17, 2024Updated last year
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)β20Jan 18, 2026Updated 4 months ago
- Lowering PyTorch's Memory Consumption for Selective Differentiationβ12Aug 29, 2024Updated last year
- Instant Visualization of Point Clouds [Eurographics 2022]β15Apr 8, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"β85Oct 14, 2025Updated 7 months ago
- A simple, generic, and flexible keyframe animation library for Rust.β30Jun 1, 2026Updated last week
- β25Sep 5, 2025Updated 9 months ago
- ASL Fingerspelling recognition in the wildβ13Nov 21, 2019Updated 6 years ago
- Sparse Transformer with limited attention span in PyTorchβ15Apr 4, 2021Updated 5 years ago
- [Arxiv2022] Interpreting Class Conditional GANs with Channel Awarenessβ17Apr 4, 2022Updated 4 years ago
- β17Nov 10, 2021Updated 4 years ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.β31Aug 7, 2025Updated 10 months ago
- β15Nov 26, 2019Updated 6 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"β33Apr 12, 2025Updated last year
- This repo contains the official implementation of paper "Layered Controllabel Video Generation".β13Oct 31, 2022Updated 3 years ago
- β19Jun 20, 2025Updated 11 months ago
- πThe official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"β25Dec 2, 2025Updated 6 months ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And Moreβ25Feb 25, 2025Updated last year
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation lβ¦β14Sep 13, 2021Updated 4 years ago
- β56Nov 6, 2024Updated last year