onion-liu / arxiv_daily_aigcLinks
An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC
☆73Updated this week
Alternatives and similar repositories for arxiv_daily_aigc
Users that are interested in arxiv_daily_aigc are comparing it to the libraries listed below
Sorting:
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆226Updated last week
- Customize your arXiv recommendation every day.☆138Updated 3 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆131Updated last year
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆88Updated 10 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆123Updated 2 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆198Updated 3 weeks ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆310Updated 3 months ago
- 收集整理一些在Seedream 4.0 下生成的令人惊艳的图片和提示词☆112Updated 4 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆268Updated last month
- Cookbook for Crafting Good Code☆57Updated last year
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆37Updated last year
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆282Updated 9 months ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆482Updated 11 months ago
- ☆170Updated last year
- Learning records for building a large language model from scratch☆58Updated last year
- ☆298Updated last year
- Fetch arxiv data to LLM-friendly text☆128Updated last month
- Awesome Instruction Editing. Image and Media Editing with Human Instructions. Instruction-Guided Image and Media Editing.☆103Updated 2 months ago
- A reproduction of the Deepseek-OCR model including training☆202Updated 2 months ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆298Updated 3 months ago
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆131Updated 4 months ago
- Open Image Curation Tools☆47Updated 9 months ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆163Updated 10 months ago
- ☆80Updated 9 months ago
- [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models☆87Updated 8 months ago
- [ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models☆81Updated last year
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆71Updated 5 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆259Updated 8 months ago
- Youtu-Tip: Tap for Intelligence, Keep on Device.☆508Updated this week
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆82Updated last year