yachty66 / gpt_pdf_md
π gpt_pdf_md: Convert PDF to Markdown with GPT-4V & more. Extract images, upload to Google Cloud, & generate Markdown with images. Python, GPT-4V Vision, Scala. Ideal for developers, researchers. PDF to Markdown, GPT-4V, image extraction, Python package
β76Updated last year
Related projects β
Alternatives and complementary repositories for gpt_pdf_md
- β136Updated 11 months ago
- Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.β149Updated 7 months ago
- A backend API to perform search over Wikipedia using LangChain, Cohere and Weaviateβ105Updated last year
- Build your Swarm of Internet Agents using MultiOn πβ77Updated 10 months ago
- A simple wrapper for OpenAI to log input/outputs.β103Updated last year
- πͺ Personalized LLM Agents πͺβ106Updated last year
- π The open-source autonomous agent LLM initiative πβ90Updated 9 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.β99Updated 8 months ago
- DIY simulacraβbuild and run your own simulation. π€πβ25Updated last year
- β81Updated 11 months ago
- βοΈ build cognitive systems, pythonicβ326Updated this week
- Open-source framework that gives you AI Agents that help you navigate decision-making, get personalized goals and execute themβ150Updated 2 weeks ago
- Annoucing Instructor Cloudβ34Updated 3 months ago
- β114Updated 5 months ago
- Fluid Databaseβ114Updated 2 months ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChainβ43Updated last year
- Playground for various AI projects and demosβ184Updated last year
- CLAIRe: Conversational Learning AI with Recallβ68Updated last year
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMSβ92Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.β111Updated last month
- A spotify playlist agent using CrewAIβ81Updated 5 months ago
- A langchain app to visualise a debate using Tree-of-Thought reasoningβ56Updated 8 months ago
- β68Updated last year
- Globot is an agent that controls your browser using playwright and GPT-4V.β132Updated 10 months ago
- Pull high-quality, efficient embeddings for PubMed, arXiv and Wikipedia from Huggingface and use for local LLM inference/Retrieval Augmenβ¦β39Updated 9 months ago
- A Python package to dynamically load functions for OpenAI Assistantβ55Updated 11 months ago
- β87Updated last year
- LUI: Autonomous Collective Decision Making via Large Language Modelsβ104Updated last year
- Turns an Airtable base into a WebGL knowledge graph leveraging relational columnsβ35Updated 6 months ago
- self-improving user memory framework for conversational AI appsβ145Updated last week