Travvy88 / DocumentGenerator_DoGeLinks
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
☆29Updated 5 months ago
Alternatives and similar repositories for DocumentGenerator_DoGe
Users that are interested in DocumentGenerator_DoGe are comparing it to the libraries listed below
Sorting:
- Tools and agents for automated research.☆47Updated last month
- Automatic Prompt Optimization Framework☆157Updated 2 weeks ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Updated last year
- Scripts and stuff☆18Updated 2 years ago
- ☆12Updated 2 years ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Updated 10 months ago
- Библиотека распознавания документов удостоверяющих личность РФ☆38Updated 7 months ago
- Effective LLM Alignment Toolkit☆151Updated 6 months ago
- GigaChain telegram bot example for technical support☆37Updated last year
- An easy-to-run OCR model pipeline based on CRNN and CTC loss☆48Updated 3 months ago
- AI-powered text compression tool that condenses content while preserving meaning across multiple formats.☆23Updated last year
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings☆44Updated 8 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆38Updated 3 months ago
- Universal LLM Telegram chatbot in Python☆17Updated last year
- Talk to YouTube☆41Updated 2 years ago
- Dialoqbase Lite is a Chrome extension that offers a web-based UI and a side panel, Copilot, designed specifically for almost all AI provi…☆43Updated 8 months ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆36Updated 10 months ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning☆14Updated last year
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4☆21Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 9 months ago
- GigaAgent — это универсальный агент-оркестратор для решения широкого круга задач (ReAct + REPL)☆140Updated 2 weeks ago
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or Yan…☆72Updated 10 months ago
- ☆31Updated last year
- ☆47Updated 3 years ago
- Thin wrapper around OpenAI Whisper API with streaming support☆86Updated last month
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆28Updated 7 months ago
- Handwritten Text Generation☆17Updated 3 years ago
- Telegram bot that interacts with the local Ollama 🦙 to answer user messages☆17Updated last year
- Figma MCP server for accelerating layout development☆17Updated 4 months ago