huridocs / pdf-reading-order
☆11Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for pdf-reading-order
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆11Updated 3 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Using short models to classify long texts☆20Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- Lightweight Non-Parametric Embedding Fine-Tuning☆17Updated last month
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 3 years ago
- ☆14Updated 3 weeks ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated 2 years ago
- ☆27Updated last year
- Small python package to measure OCR quality and other related metrics.☆20Updated 8 months ago
- Official repository for RAGVIZ: Diagnose and Visualize Retrieval-Augmented Generation☆21Updated 3 weeks ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- ☆19Updated last year
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Updated 2 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- ☆14Updated last month
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Benchmarks for Business Document Foundation Models☆11Updated 7 months ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆17Updated 3 years ago
- ☆12Updated 3 weeks ago
- 🦖 X—LLM: Simple & Cutting Edge LLM Finetuning☆11Updated 11 months ago
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated this week
- ☆20Updated this week