yujxx / PodAgentLinks
PodAgent: A Comprehensive Framework for Podcast Generation
☆87Updated 3 weeks ago
Alternatives and similar repositories for PodAgent
Users that are interested in PodAgent are comparing it to the libraries listed below
Sorting:
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆169Updated last month
- ☆65Updated 8 months ago
- A curated list of Video to Audio Generation☆45Updated last month
- "AI-Creator: Multi-Modal Agents for Video Production"☆143Updated this week
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆75Updated 2 weeks ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆167Updated 3 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆105Updated 2 weeks ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆20Updated this week
- ☆158Updated 3 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- ☆83Updated 3 weeks ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆224Updated last week
- ☆16Updated 10 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验 佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆75Updated last month
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆35Updated 8 months ago
- OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Rea…☆51Updated last week
- GLM Series Edge Models☆141Updated 3 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆120Updated 6 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆22Updated last month
- ☆77Updated 2 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆73Updated 5 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆102Updated 2 weeks ago
- ☆223Updated last week
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆145Updated 2 months ago
- ☆48Updated 3 weeks ago
- GPT-4o-level, real-time spoken dialogue system.☆327Updated 4 months ago
- A project for tri-modal LLM benchmarking and instruction tuning.☆35Updated 2 months ago
- Collection of model-centric MCP servers☆17Updated 2 weeks ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 4 months ago