Travvy88 / DocumentGenerator_DoGeLinks
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
☆25Updated last month
Alternatives and similar repositories for DocumentGenerator_DoGe
Users that are interested in DocumentGenerator_DoGe are comparing it to the libraries listed below
Sorting:
- Tools and agents for automated research.☆37Updated this week
- Handwritten Text Generation☆17Updated 2 years ago
- Scripts and stuff☆18Updated 2 years ago
- ☆46Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆32Updated 3 weeks ago
- GigaChain telegram bot example for technical support☆35Updated 8 months ago
- An easy-to-run OCR model pipeline based on CRNN and CTC loss☆47Updated 2 years ago
- Effective LLM Alignment Toolkit☆141Updated 2 months ago
- Framework for processing and filtering datasets☆27Updated last year
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Updated 10 months ago
- По возможности актуальная информация по ИИ + ресерчи от ChatGPT☆21Updated 2 months ago
- SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or Yan…☆72Updated 6 months ago
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆15Updated 8 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆44Updated 6 months ago
- ☆11Updated 2 years ago
- LLM-based meme generator with templates☆13Updated 5 months ago
- Top ML papers of the week.☆40Updated this week
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated last year
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Updated 6 months ago
- OmniFusion — a multimodal model to communicate using text and images☆233Updated last year
- Простой нормализатор текстов перед синтезом речи☆38Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Updated last year
- CLIP implementation for Russian language☆146Updated last year
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 8 months ago
- Telegram bot for different language models. Supports system prompts and images☆60Updated 2 months ago
- The project to find correlation between tweets and future stock prices☆12Updated 2 years ago
- Private chat bot on aiogram for limited access to ChatGPT with custom personalities.☆30Updated 6 months ago
- ☆31Updated 11 months ago
- Dialoqbase Lite is a Chrome extension that offers a web-based UI and a side panel, Copilot, designed specifically for almost all AI provi…☆43Updated 4 months ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Updated 2 years ago