starriver030515 / FUSIONLinks
☆173Updated 2 months ago
Alternatives and similar repositories for FUSION
Users that are interested in FUSION are comparing it to the libraries listed below
Sorting:
- 🎉 [ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning…☆110Updated last month
- 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.☆87Updated 2 weeks ago
- Official code for "Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models"☆204Updated last month
- Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"☆139Updated 3 weeks ago
- ☆102Updated 3 weeks ago
- Official Repo for WWW 2025 paper "Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents"☆199Updated 2 months ago
- Official code for ACL2025 "🔍 Retrieval Models Aren’t Tool-Savvy: Benchmarking Tool Retrieval for Large Language Models"☆176Updated 3 weeks ago
- [CVPR'25] Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception☆16Updated last month
- CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025)☆100Updated 3 weeks ago
- Official Repository of paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆145Updated 3 months ago
- MM-IFEngine: Towards Multimodal Instruction Following☆92Updated 2 months ago
- Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"☆46Updated this week
- Teaching LMMs for Image Quality Scoring and Interpreting☆91Updated 3 months ago
- Official Pytorch Code of the Paper "LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation"☆39Updated 2 weeks ago
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆126Updated last month
- 🤖 Discord AI assistant with OpenAI, Gemini, Claude & DeepSeek integration, multilingual support, multimodal chat, image generation, web …☆245Updated 3 months ago
- ☆35Updated 2 months ago
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆73Updated 3 weeks ago
- This is our official implementation for the paper, accepted by IEEE TKDE 2025.☆109Updated 2 weeks ago
- [ICLR 2025] "Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)☆343Updated 2 months ago
- A Python library for unified access to multi-source spatiotemporal Earth observation data, supporting major meteorological and oceanograp…☆146Updated last month
- [DASFAA'25] LLM4GraphTopology: Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs☆29Updated 2 weeks ago
- ☆98Updated 2 months ago
- How2Sign, Youtube-ASL preprocess includes download, and Mediapipe process.☆92Updated last month
- 梯子推荐🪜☆263Updated 3 weeks ago
- 🎉TypeScript Execute (tsx): Dynamically compile TSX/TS file and execute it. The easiest way to run .tsx in Nodejs.☆309Updated 3 months ago
- [AAAI 2025] The code repository for "MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning" in PyTorch.☆62Updated 2 months ago
- The official implementation for START (NeurIPS 2024).☆36Updated 4 months ago
- 基于深度学习的低代码计算机视觉系统,包含图像采集、智能检测、数据标注、模型训练四大模块。☆223Updated last month
- Official code for paper "Learning to Use Tools via Cooperative and Interactive Agents"☆137Updated last year