aassxun / Understanding-Vision-Tasks
☆207Updated last month
Alternatives and similar repositories for Understanding-Vision-Tasks:
Users that are interested in Understanding-Vision-Tasks are comparing it to the libraries listed below
- Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs☆152Updated 2 weeks ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆46Updated 8 months ago
- Efficient controlnet for DiTs☆65Updated last week
- 从0到1手写基于mnist手写数字数据集的diffusion模型复现☆36Updated 2 weeks ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆494Updated last month
- [Arxiv 2024] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation☆63Updated 8 months ago
- Run JavaScript code from Python.☆101Updated 3 weeks ago
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆67Updated last week
- Official repository of MMGenBench☆119Updated 2 weeks ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆308Updated 2 months ago
- ☆160Updated 5 months ago
- Workflow runner engine for argo framework☆100Updated last month
- ☆67Updated last week
- ☆420Updated 7 months ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆64Updated 2 months ago
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆43Updated last month
- ANIMAT is the first AI platform to integrate MMD and facial tracking for dynamic 3D Model, enabling realistic customization and upgrade o…☆82Updated last month
- The 1st dynamic phishing kit dataset☆100Updated last month
- Main Project of AIDE☆91Updated last month
- ☆100Updated 2 months ago
- A dataset for fall detection using photorealistic virtual environments.☆47Updated last month
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆37Updated 6 months ago
- ☆226Updated last month
- Launching the "Agent Creation Toolkit", providing developers with an intuitive and efficient Development Environment, supporting the rapi…☆43Updated this week
- ☆124Updated last month
- Virtual to Real, Synthetic Data, Vehicle Re-identification☆104Updated 3 months ago
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆40Updated 2 weeks ago
- Code implementation of the paper accepted by IEEE TKDE2024: "Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach"☆104Updated 3 months ago
- MAX31855 full function driver library for general MCU and Linux.☆70Updated 3 weeks ago
- ☆23Updated 5 months ago