Travvy88 / DocumentGenerator_DoGeLinks
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
☆25Updated 2 weeks ago
Alternatives and similar repositories for DocumentGenerator_DoGe
Users that are interested in DocumentGenerator_DoGe are comparing it to the libraries listed below
Sorting:
- Tools and agents for automated research.☆34Updated this week
- GigaChain telegram bot example for technical support☆34Updated 7 months ago
- Effective LLM Alignment Toolkit☆139Updated last month
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆43Updated 4 months ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆16Updated 7 months ago
- Scripts and stuff☆18Updated 2 years ago
- Handwritten Text Generation☆17Updated 2 years ago
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟☆19Updated last week
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated last year
- Top ML papers of the week.☆38Updated this week
- Библиотека распознавания документов удостоверяющих личность РФ☆24Updated 2 months ago
- ☆11Updated 2 years ago
- CLIP implementation for Russian language☆146Updated last year
- ☆48Updated last month
- ☆22Updated last year
- Framework for processing and filtering datasets☆27Updated last year
- The project to find correlation between tweets and future stock prices☆12Updated 2 years ago
- ☆46Updated 2 years ago
- Russian Text Expansion based on ruGPT3Large☆25Updated 3 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆27Updated 4 months ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆13Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 10 months ago
- Augmentex — a library for augmenting texts with errors☆65Updated last year
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 6 months ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆31Updated 5 months ago
- SAGE: Spelling correction, corruption and evaluation for multiple languages☆158Updated 7 months ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆27Updated 2 years ago
- Private chat bot on aiogram for limited access to ChatGPT with custom personalities.☆30Updated 5 months ago
- Примеры продвинутого RAG☆37Updated 10 months ago