onion-liu / arxiv_daily_aigcLinks
An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC
☆71Updated this week
Alternatives and similar repositories for arxiv_daily_aigc
Users that are interested in arxiv_daily_aigc are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆120Updated last month
- Customize your arXiv recommendation every day.☆137Updated 3 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆225Updated 8 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆130Updated last year
- 收集整理一些在Seedream 4.0 下生成的令人惊艳的图片和提示词☆112Updated 3 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆267Updated 3 weeks ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆88Updated 9 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆196Updated last week
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆308Updated 2 months ago
- ☆170Updated last year
- Multi-profile Claude Code launcher with secure credential management☆162Updated this week
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆37Updated last year
- Cookbook for Crafting Good Code☆57Updated last year
- ☆293Updated last year
- ☆80Updated 8 months ago
- A reproduction of the Deepseek-OCR model including training☆200Updated last month
- 论文阅读工具,一键截图+AI翻译,支持数学公式,贴片多窗口管理☆131Updated 4 months ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆482Updated 11 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆275Updated 9 months ago
- AI-Powered Video Retrieval & Clipping Tool☆376Updated 4 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆71Updated 4 months ago
- Awesome Instruction Editing. Image and Media Editing with Human Instructions. Instruction-Guided Image and Media Editing.☆101Updated last month
- Open Image Curation Tools☆47Updated 8 months ago
- Video generation via code☆1,401Updated last month
- AI视频剪辑☆289Updated last month
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆493Updated last week
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆292Updated 2 months ago
- Open-source alternative for crowdtest.ai. Simulate how users might react to different versions of your content☆158Updated 10 months ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆291Updated 2 months ago
- Fetch arxiv data to LLM-friendly text☆128Updated 3 weeks ago