MapariAbdullah / Llama2-Custom-document-QALinks
Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer
☆15Updated 2 years ago
Alternatives and similar repositories for Llama2-Custom-document-QA
Users that are interested in Llama2-Custom-document-QA are comparing it to the libraries listed below
Sorting:
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆21Updated 6 months ago
- Magface Triton Inferece Server Using Tensorrt☆17Updated 3 years ago
- ☆11Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- General template for most Pytorch projects☆35Updated 6 months ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Updated 4 years ago
- Synthetic identity documents dataset☆28Updated 7 months ago
- Create TensorRT-runtime for vietocr☆12Updated 4 years ago
- ToRoLaMa: The Vietnamese Instruction-Following and Chat Model☆24Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 11 months ago
- ☆54Updated 2 years ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated last year
- DaisyKit is an easy AI toolkit with face mask detection, pose detection, background matting, barcode detection, face recognition and more…☆109Updated 2 years ago
- Set up CI in DL/ cuda/ cudnn/ TensorRT/ onnx2trt/ onnxruntime/ onnxsim/ Pytorch/ Triton-Inference-Server/ Bazel/ Tesseract/ PaddleOCR/ NV…☆43Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- Implementation of the DocLLM paper for Llama models.☆13Updated 6 months ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆17Updated last year
- This repo gives an introduction to how to make full working example to serve your model using asynchronous Celery tasks and FastAPI. 🔥 …☆31Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆33Updated 2 years ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆116Updated 2 years ago
- Count GitHub Stars ⭐☆33Updated this week
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆99Updated 2 years ago
- Wanwu models release, code will be released soon☆24Updated 3 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- Key information extraction from invoice document with Graph Convolution Network☆55Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆31Updated 4 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Updated last year
- An SDK for Transformers + YOLO and other SSD family models☆64Updated 9 months ago
- Torchserve + TensorRT + Detection☆19Updated 3 years ago