ElvisClaros / GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for GOT-OCR2.0
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆16Updated this week
- Tutorials from AutoGen Basics to Use Cases☆27Updated 11 months ago
- AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows☆19Updated 3 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last month
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆31Updated last month
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆40Updated last week
- 通过该项目将Dify通过Pipeline接入OpenwebUI,可以兼并OpenwebUI的前端优势和相应生态以及Dify强大的模型可拓展性和Workflow的效益。☆9Updated 2 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆22Updated 10 months ago
- ☆57Updated last month
- time based thinking and structure like OpenAI's o1 preview.☆11Updated last month
- ☆51Updated 3 months ago
- ☆12Updated 3 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆54Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 8 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 10 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆23Updated last month
- Open source and AI-powered web search engine: local, private, dockerized and supported by a fluffy llama🦙☆51Updated 3 months ago
- ☆29Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 2 weeks ago
- ☆19Updated 8 months ago
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆33Updated 3 months ago
- LLM reads a paper and produce a working prototype☆34Updated this week
- ☆12Updated 7 months ago
- High level tool use for LLMs☆34Updated 3 months ago
- A simple AI Agent Framework using table software like Excel/Google Sheets as GUI☆14Updated 4 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆17Updated 2 weeks ago
- 🧩 / ● Open Interpreter - This plugin integrates Open Interpreter into LobeChat, allowing you to control your computer with a chat interf…☆19Updated 10 months ago
- ☆35Updated last year