🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code execution & editing
☆41Oct 20, 2025Updated 5 months ago
Alternatives and similar repositories for LLM-I
Users that are interested in LLM-I are comparing it to the libraries listed below
Sorting:
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆40Jul 5, 2025Updated 8 months ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"☆139Sep 4, 2025Updated 6 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆51Sep 4, 2025Updated 6 months ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 6 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆16Jan 14, 2021Updated 5 years ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆21Aug 5, 2025Updated 7 months ago
- 深度学习与围棋学习☆16Oct 27, 2021Updated 4 years ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆29Nov 4, 2025Updated 4 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"☆19Mar 10, 2025Updated last year
- Semi-supervised Domain Adaptation of Machine Translation☆12Dec 8, 2022Updated 3 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆57Oct 14, 2025Updated 5 months ago
- Enemies for your LLM☆35Jan 20, 2026Updated 2 months ago
- ☆11Mar 26, 2020Updated 5 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- [ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"☆28Feb 4, 2026Updated last month
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 9 months ago
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆44Jun 6, 2025Updated 9 months ago
- ☆27Jan 5, 2026Updated 2 months ago
- ☆12Oct 17, 2024Updated last year
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 2 months ago
- Instant Visualization of Point Clouds [Eurographics 2022]☆15Apr 8, 2024Updated last year
- decontamination☆26Mar 4, 2026Updated 2 weeks ago
- ASL Fingerspelling recognition in the wild☆13Nov 21, 2019Updated 6 years ago
- A Fine-grained Benchmark for Video Captioning and Retrieval☆27Jul 16, 2025Updated 8 months ago
- Sparse Transformer with limited attention span in PyTorch☆15Apr 4, 2021Updated 4 years ago
- [Arxiv2022] Interpreting Class Conditional GANs with Channel Awareness☆17Apr 4, 2022Updated 3 years ago
- Sotopia-RL: Reward Design for Social Intelligence☆47Jan 29, 2026Updated last month
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- FakePartsBench: 25K+ AI-generated videos with pixel- and frame-level annotations of full and partial deepfakes.☆24Aug 31, 2025Updated 6 months ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- Implementation of TransAE model described in Multimodal Data Enhanced Representation Learning for Knowledge Graphs☆17Oct 31, 2020Updated 5 years ago
- ☆46Feb 18, 2026Updated last month
- A ratatui based vertical and horizontal slider.☆39Mar 11, 2026Updated last week