☆44Jan 4, 2026Updated 4 months ago
Alternatives and similar repositories for AIA
Users that are interested in AIA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- 150本信息安全方面的书籍书籍(持续更新)☆15Feb 16, 2023Updated 3 years ago
- [ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆80Mar 31, 2026Updated last month
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆47Jul 22, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code repository of UniRL☆52May 30, 2025Updated 11 months ago
- ☆24May 23, 2025Updated 11 months ago
- This is the official implementation of paper "Multi-Prior Learning via Neural Architecture Search for Blind Face Restoration".☆13Jun 20, 2022Updated 3 years ago
- docker-compose 一键搭建 nextcloud 个人网盘☆12Nov 26, 2021Updated 4 years ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆94Aug 8, 2025Updated 9 months ago
- [ICLR2026] The official code of "Weak-to-Strong Diffusion with Reflection".☆58Jan 28, 2026Updated 3 months ago
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆101May 2, 2026Updated last week
- Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …☆13Sep 6, 2023Updated 2 years ago
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆46Nov 20, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 数据库实践课设:利用C#和SQL-Server实现简易的选课系统☆10Oct 11, 2020Updated 5 years ago
- ☆54Dec 10, 2025Updated 4 months ago
- [NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO☆140Oct 15, 2025Updated 6 months ago
- (IJCV2025) The official implementation of "DiffuVolume: Diffusion Model for Volume based Stereo Matching"☆30Jan 15, 2025Updated last year
- [CVPR2026] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning☆45Mar 27, 2026Updated last month
- ☆54Feb 9, 2026Updated 2 months ago
- Hydro - modified by xiabee☆14May 7, 2022Updated 4 years ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆210May 5, 2025Updated last year
- ☆147Feb 28, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆147Mar 30, 2026Updated last month
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆128Jan 29, 2026Updated 3 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 3 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆83Aug 25, 2025Updated 8 months ago
- NJU CS核心课程,算法导论,相关实验☆17Aug 5, 2019Updated 6 years ago
- ☆38Dec 16, 2025Updated 4 months ago
- A collection of vision foundation models unifying understanding and generation.☆60Jan 2, 2025Updated last year
- Official implementation of "Opt-In Art: Learning Art Styles Only from Few Examples" (Accepted by NeurIPS 2025)☆33Nov 30, 2025Updated 5 months ago
- SkillX: Automatically Constructing Skill Knowledge Bases for Agents☆107Apr 30, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆63Jan 5, 2026Updated 4 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆32Feb 10, 2026Updated 2 months ago
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆134Jan 30, 2026Updated 3 months ago
- This is the official implementation of the paper titled "Comprehensive Comparison of Vision Transformers and Traditional Convolutional Ne…☆14Mar 4, 2025Updated last year
- Auto1111 port of NVlab's adversarial purification method that uses the forward and reverse processes of diffusion models to remove advers…☆13Aug 8, 2023Updated 2 years ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆23Jun 26, 2024Updated last year
- ☆43Apr 8, 2026Updated last month