🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code execution & editing
☆41Oct 20, 2025Updated 7 months ago
Alternatives and similar repositories for LLM-I
Users that are interested in LLM-I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆51Sep 4, 2025Updated 8 months ago
- ☆19Feb 25, 2024Updated 2 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 3 months ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆21Aug 5, 2025Updated 9 months ago
- 深度学习与围棋学习☆16Oct 27, 2021Updated 4 years ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆30Nov 4, 2025Updated 6 months ago
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"☆19Mar 10, 2025Updated last year
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆59Nov 5, 2025Updated 6 months ago
- Semi-supervised Domain Adaptation of Machine Translation☆12Dec 8, 2022Updated 3 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- Enemies for your LLM☆36Jan 20, 2026Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- A yolov5 based application, it uses the prediction results by yolov5 to activate the selected opencv built-in tracking algorithm.☆10Jul 24, 2020Updated 5 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- [ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"☆26Feb 4, 2026Updated 3 months ago
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆43Jun 6, 2025Updated 11 months ago
- ☆29Mar 30, 2026Updated last month
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 8 months ago
- ☆18May 17, 2024Updated 2 years ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Dec 22, 2017Updated 8 years ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆75Oct 14, 2025Updated 7 months ago
- ASL Fingerspelling recognition in the wild☆13Nov 21, 2019Updated 6 years ago
- [Arxiv2022] Interpreting Class Conditional GANs with Channel Awareness☆17Apr 4, 2022Updated 4 years ago
- Sparse Transformer with limited attention span in PyTorch☆15Apr 4, 2021Updated 5 years ago
- ☆17Nov 10, 2021Updated 4 years ago
- A Fine-grained Benchmark for Video Captioning and Retrieval☆28Jul 16, 2025Updated 10 months ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆31Aug 7, 2025Updated 9 months ago
- under review☆14Mar 1, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆33Apr 12, 2025Updated last year
- ☆19Jun 20, 2025Updated 10 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆23Dec 2, 2025Updated 5 months ago
- We deal with the problem of zero-shot cross-modal image retrieval involving color and sketch images through a novel deep representation l…☆14Sep 13, 2021Updated 4 years ago
- ☆56Nov 6, 2024Updated last year
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Jan 10, 2023Updated 3 years ago
- ☆48Feb 18, 2026Updated 3 months ago