Travvy88 / DocumentGenerator_DoGeLinks
Synthetic Document Generator for Document AI. Creates document images annotated with text and bounding boxes of each word. Images contain headings, tables, paragraphs with different formatting and fonts. Can be used in OCR, document transformers pretraining, text detection and more other tasks.
☆28Updated 4 months ago
Alternatives and similar repositories for DocumentGenerator_DoGe
Users that are interested in DocumentGenerator_DoGe are comparing it to the libraries listed below
Sorting:
- Tools and agents for automated research.☆46Updated 3 weeks ago
- Scripts and stuff☆18Updated 2 years ago
- LLM-based meme generator with templates☆13Updated 7 months ago
- Russian Text Expansion based on ruGPT3Large☆25Updated 3 years ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Updated last year
- Handwritten Text Generation☆17Updated 3 years ago
- Effective LLM Alignment Toolkit☆150Updated 5 months ago
- Training and data processing code for Saiga☆53Updated 4 months ago
- OmniFusion — a multimodal model to communicate using text and images☆233Updated last year
- Universal LLM Telegram chatbot in Python☆17Updated last year
- Thin wrapper around OpenAI Whisper API with streaming support☆89Updated 10 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆45Updated 8 months ago
- GigaChain telegram bot example for technical support☆37Updated 11 months ago
- CLIP implementation for Russian language☆147Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆37Updated last month
- Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.☆382Updated 7 months ago
- LangChain-compatible integrations with YandexGPT and YandexGPT Embeddings☆44Updated 6 months ago
- A set of scripts and configurations for pretraining of Large Language Models (LLM)☆34Updated 8 months ago
- SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and DALL-E, Stability AI or Yan…☆72Updated 8 months ago
- Примеры продвинутого RAG☆40Updated last year
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆60Updated 2 years ago
- ☆22Updated 2 years ago
- Top ML papers of the week.☆41Updated 3 weeks ago
- ☆17Updated 2 years ago
- Dialoqbase Lite is a Chrome extension that offers a web-based UI and a side panel, Copilot, designed specifically for almost all AI provi…☆43Updated 7 months ago
- ExplainitAll — это библиотека для интерпретируемого ИИ, предназначенная для интерпретации генеративных моделей ( GPT-like), и векторизато…☆19Updated last year
- Automatic Prompt Optimization Framework☆52Updated this week
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4☆22Updated last year
- Telegram bot for different language models. Supports system prompts and images☆63Updated 5 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆95Updated last year