π LLM-I: Transform LLMs into natural interleaved multimodal creators! β¨ Tool-use framework supporting image search, generation, code execution & editing
β41Oct 20, 2025Updated 5 months ago
Alternatives and similar repositories for LLM-I
Users that are interested in LLM-I are comparing it to the libraries listed below
Sorting:
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generationβ40Jul 5, 2025Updated 8 months ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.β12Nov 27, 2024Updated last year
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"β139Sep 4, 2025Updated 6 months ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Informationβ13Oct 1, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasksβ51Sep 4, 2025Updated 6 months ago
- β12Nov 5, 2024Updated last year
- β19Feb 25, 2024Updated 2 years ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflectionβ56Aug 16, 2025Updated 7 months ago
- Official repository for βReasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Spaceββ18Jan 27, 2026Updated last month
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Searchβ62Jul 4, 2025Updated 8 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Useβ29Nov 4, 2025Updated 4 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USEβ58Nov 5, 2025Updated 4 months ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"β57Oct 14, 2025Updated 5 months ago
- Enemies for your LLMβ35Jan 20, 2026Updated 2 months ago
- RAG methods, benchmarks, and toolkitsβ19Nov 28, 2024Updated last year
- β10Oct 20, 2020Updated 5 years ago
- A yolov5 based application, it uses the prediction results by yolov5 to activate the selected opencv built-in tracking algorithm.β10Jul 24, 2020Updated 5 years ago
- β10Jun 3, 2019Updated 6 years ago
- β17May 17, 2024Updated last year
- β27Jan 5, 2026Updated 2 months ago
- Lowering PyTorch's Memory Consumption for Selective Differentiationβ12Aug 29, 2024Updated last year
- Instant Visualization of Point Clouds [Eurographics 2022]β15Apr 8, 2024Updated last year
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generationβ25Aug 5, 2025Updated 7 months ago
- A simple, generic, and flexible keyframe animation library for Rust.β30Dec 30, 2025Updated 2 months ago
- β25Sep 5, 2025Updated 6 months ago
- ASL Fingerspelling recognition in the wildβ13Nov 21, 2019Updated 6 years ago
- A Fine-grained Benchmark for Video Captioning and Retrievalβ27Jul 16, 2025Updated 8 months ago
- [Arxiv2022] Interpreting Class Conditional GANs with Channel Awarenessβ17Apr 4, 2022Updated 3 years ago
- Sotopia-RL: Reward Design for Social Intelligenceβ47Jan 29, 2026Updated last month
- β15Nov 26, 2019Updated 6 years ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.β31Aug 7, 2025Updated 7 months ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And Moreβ25Feb 25, 2025Updated last year
- FakePartsBench: 25K+ AI-generated videos with pixel- and frame-level annotations of full and partial deepfakes.β24Aug 31, 2025Updated 6 months ago
- β56Nov 6, 2024Updated last year
- Implementation of TransAE model described in Multimodal Data Enhanced Representation Learning for Knowledge Graphsβ17Oct 31, 2020Updated 5 years ago
- Translate Markdown files from one language to another using OpenAI's API while retaining original formatting. This Jupyter notebook tokenβ¦β23Oct 15, 2023Updated 2 years ago
- Masked Vision Transformer for Text Recognitionβ11Nov 13, 2024Updated last year
- β18Oct 3, 2023Updated 2 years ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learnersβ22Jun 6, 2025Updated 9 months ago