Travvy88 / DocumentGenerator_DoGeLinks
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
☆29Updated 6 months ago
Alternatives and similar repositories for DocumentGenerator_DoGe
Users that are interested in DocumentGenerator_DoGe are comparing it to the libraries listed below
Sorting:
- Tools and agents for automated research.☆48Updated last month
- Automatic Prompt Optimization Framework☆170Updated this week
- Scripts and stuff☆18Updated 2 years ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆16Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 10 months ago
- LLM-based meme generator with templates☆13Updated 2 months ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Updated last year
- An easy-to-run OCR model pipeline based on CRNN and CTC loss☆49Updated 4 months ago
- Библиотека распознавания документов удостоверяющих личность РФ☆40Updated 8 months ago
- GigaChain telegram bot example for technical support☆36Updated last year
- Effective LLM Alignment Toolkit☆152Updated 7 months ago
- Примеры продвинутого RAG☆40Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆39Updated last week
- Training and data processing code for Saiga☆54Updated 3 weeks ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated 2 years ago
- Handwritten Text Generation☆17Updated 3 years ago
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings☆43Updated 9 months ago
- Russian Text Expansion based on ruGPT3Large☆25Updated 3 years ago
- ☆31Updated last year
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- Telegram MCP Server and HTTP-MTProto bridge | Multi-user auth, intelligent search, file sending, web setup | Docker & PyPI ready☆19Updated last week
- ☆22Updated 2 years ago
- ☆47Updated 3 years ago
- Framework for processing and filtering datasets☆31Updated last year
- GigaAgent — это универсальный агент-оркестратор для решения широкого круга задач (ReAct + REPL)☆145Updated 2 weeks ago
- Telegram bot for different language models. Supports system prompts and images☆63Updated 7 months ago
- Thin wrapper around OpenAI Whisper API with streaming support☆86Updated last month
- CLIP implementation for Russian language☆148Updated 2 years ago
- Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟☆25Updated 5 months ago
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆32Updated last month