🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code execution & editing
☆41Oct 20, 2025Updated 8 months ago
Alternatives and similar repositories for LLM-I
Users that are interested in LLM-I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆64Jul 5, 2025Updated 11 months ago
- [EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information☆13Oct 1, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆54Sep 4, 2025Updated 9 months ago
- ☆12Nov 5, 2024Updated last year
- ☆19Feb 25, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 10 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 5 months ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆22Aug 5, 2025Updated 10 months ago
- 深度学习与围棋学习☆15Oct 27, 2021Updated 4 years ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]☆64Jul 4, 2025Updated 11 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆62Nov 5, 2025Updated 7 months ago
- A yolov5 based application, it uses the prediction results by yolov5 to activate the selected opencv built-in tracking algorithm.☆10Jul 24, 2020Updated 5 years ago
- ☆29Mar 30, 2026Updated 2 months ago
- ☆12Oct 17, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆19May 17, 2024Updated 2 years ago
- A simple, generic, and flexible keyframe animation library for Rust.☆30Jun 1, 2026Updated 3 weeks ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆89Oct 14, 2025Updated 8 months ago
- Based on assafelovic/gpt-researcher - Modified to support local Ollama models☆16May 15, 2024Updated 2 years ago
- A Fine-grained Benchmark for Video Captioning and Retrieval☆30Jul 16, 2025Updated 11 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆32Aug 7, 2025Updated 10 months ago
- This repo contains the official implementation of paper "Layered Controllabel Video Generation".☆13Oct 31, 2022Updated 3 years ago
- ☆19Jun 20, 2025Updated last year
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆25Dec 2, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆36Aug 5, 2025Updated 10 months ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆48Feb 18, 2026Updated 4 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆52Apr 1, 2026Updated 2 months ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- Translate Markdown files from one language to another using OpenAI's API while retaining original formatting. This Jupyter notebook token…☆23Oct 15, 2023Updated 2 years ago
- Custom LORA training on DynamiCrafter☆18Jul 26, 2024Updated last year
- Masked Vision Transformer for Text Recognition☆11Nov 13, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆207Apr 13, 2026Updated 2 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆423Jan 29, 2026Updated 5 months ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆26May 17, 2026Updated last month
- Prebuilt WASM binaries for tree-sitter's language parsers.☆16Oct 7, 2025Updated 8 months ago
- decontamination☆36Mar 4, 2026Updated 3 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆50Aug 7, 2025Updated 10 months ago
- TRCaptionNet official repository☆13Jul 25, 2024Updated last year