phkhanhtrinh23 / translation_layoutrecoveryLinks
This is a project that translates a .pdf file, preserving the original layout of that .pdf file. [UPDATED] We have achieved the Second Prize of the Cinnamon AI Bootcamp 2023.
☆105Updated 10 months ago
Alternatives and similar repositories for translation_layoutrecovery
Users that are interested in translation_layoutrecovery are comparing it to the libraries listed below
Sorting:
- Question Answering in Vietnamese. In a nutshell, this project helps us answer a Question of a given Context in Vietnamese. [UPDATED] This…☆25Updated 2 years ago
- Object Detection Model for Scanned Documents☆94Updated 6 months ago
- JARVIS Chatbot: a local simple RAG assistant with PDF files☆27Updated 2 weeks ago
- This spelling correction project helps people fix English spelling mistakes. It includes two options: Option 1 uses the Transformer model…☆18Updated 2 years ago
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆72Updated 11 months ago
- Ollama Mistral with Langchain RAG Agent and Custom tools☆13Updated last year
- Japanese Spelling Correction - JSC☆14Updated 2 years ago
- ☆17Updated last year
- wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech☆95Updated 2 months ago
- translate PDF files using GPT☆143Updated last year
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆263Updated 3 weeks ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated 2 years ago
- ToRoLaMa: The Vietnamese Instruction-Following and Chat Model☆24Updated last year
- Checkbox Detection Model for Scanned Documents☆87Updated 6 months ago
- A Vietnamese Text-to-Speech library that provides high-quality speech synthesis with voice cloning capabilities☆86Updated 2 months ago
- "LAMPAT: Low-rank Adaptation Multilingual Paraphrasing using Adversarial Training" has been accepted at The 38th AAAI Conference on Artif…☆15Updated last year
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆28Updated 2 years ago
- ☆66Updated 2 years ago
- ☆72Updated last year
- Bud500: A Comprehensive Vietnamese ASR Dataset☆67Updated last year
- TF-ID: Table/Figure IDentifier for academic papers☆240Updated last year
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆157Updated last year
- ☆13Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year
- ☆68Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆220Updated 10 months ago
- This is a project about Optical Character Recognition (OCR) in Vietnamese texts by using PaddleOCR and VietOCR.☆22Updated last year
- This is our solution dealing with BKAI challenge☆65Updated 3 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆278Updated last month
- A toolbox for Vietnamese Optical Character Recognition.☆125Updated 2 years ago