agituts / gemini-2-podcast
A Python-based tool that generates engaging podcast conversations using Google's Gemini 2.0 Flash Experimental model for script generation and text-to-speech conversion.
β105Updated 3 months ago
Alternatives and similar repositories for gemini-2-podcast:
Users that are interested in gemini-2-podcast are comparing it to the libraries listed below
- β54Updated 3 weeks ago
- The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio ποΈπβ167Updated last month
- Natural Language Application Development (NLAD) - A methodology for building applications by leveraging LLMs to create higher levels of aβ¦β83Updated 4 months ago
- PocketFlow's node-based workflow structure, with Manus' agents and tools!β178Updated this week
- uses gpt-4o and gpt-4-mini to write books on topics while researching with perplexity APIβ89Updated 3 months ago
- MCP server for enabling LLM applications to perform deep research via the MCP protocolβ77Updated 2 weeks ago
- Finally, an open source Youtube Summarizer extensionβ66Updated 3 months ago
- MarinaBox is a toolkit for creating and managing secure, isolated environments for AI agentsβ118Updated last month
- AI planner similar to OpenAI's deep researchβ137Updated this week
- β86Updated last month
- The open source chat powered by LLMs with RAG. Kollektiv makes it easy to sync your custom data sources and get accurate, contextual replβ¦β84Updated last month
- Generates breakthrough ideas from a single prompt through an 8 stage walkthrough, with optional research proposal paper.β54Updated last month
- MCP server that enhances Claude's reasoning capabilities by integrating DeepSeek R1's advanced reasoning engine π€β46Updated 2 months ago
- Insanely Fast Transcription: A Python-based utility for rapid audio transcription from YouTube videos or local files. Leverages GPU accelβ¦β80Updated 8 months ago
- β27Updated last week
- A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - implementing a three-tier architβ¦β122Updated last week
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.β57Updated last week
- Tool for scraping and consolidating documentation websites into a single MD file.β144Updated this week
- A tool that helps you build prompts and manage context with lots of code blocks in them.β99Updated last week
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.β88Updated 3 months ago
- A fork of OpenAI Swarm that supports Groq and Anthropicβ119Updated 2 months ago
- β183Updated 4 months ago
- β133Updated 2 months ago
- β39Updated 2 weeks ago
- A Model Context Protocol (MCP) server for research and documentation assistance using Perplexity AI. Won 1st @ Cline Hackathonβ167Updated last week
- An opinionated, Agentic Engineering toolbox powered by LLM Agents to solve problems autonomously.β127Updated last year
- β70Updated last month
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...noβ¦β117Updated 5 months ago
- Waldzell AI's monorepo of MCP servers. Use in Claude Desktop, Cline, Roo Code, and more!β59Updated 3 weeks ago
- Proposal for a flexible, tool-agnostic, codebase context system that helps teach AI coding tools about your codebase. Super easy to get β¦β117Updated last week