yujxx / PodAgentLinks
PodAgent: A Comprehensive Framework for Podcast Generation
☆93Updated last month
Alternatives and similar repositories for PodAgent
Users that are interested in PodAgent are comparing it to the libraries listed below
Sorting:
- "AI-Creator: Multi-Modal Agents for Video Production"☆157Updated 2 weeks ago
- A curated list of Video to Audio Generation☆49Updated last week
- ☆65Updated 9 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- Deep Reasoning Translation (DRT) Project☆224Updated last month
- flow mirror models from JZX AI Labs☆44Updated 8 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆183Updated last week
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆125Updated last month
- ☆86Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆76Updated 2 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆198Updated 3 months ago
- ☆37Updated 2 weeks ago
- Designing Multi-Agent Systems with Zero Supervision☆73Updated this week
- ☆163Updated 4 months ago
- ☆251Updated 2 months ago
- ☆16Updated 11 months ago
- ☆77Updated 2 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 4 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆18Updated last week
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆113Updated last month
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆35Updated 9 months ago
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆256Updated this week
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆64Updated last month
- The official Soundwave repository☆209Updated 3 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆54Updated 7 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆173Updated 3 months ago
- OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Rea…☆62Updated 3 weeks ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 4 months ago
- ☆255Updated 10 months ago