docugami / DFM-benchmarks
Benchmarks for Business Document Foundation Models
☆11Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for DFM-benchmarks
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- Official repository for RAGVIZ: Diagnose and Visualize Retrieval-Augmented Generation☆21Updated this week
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆22Updated 2 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆23Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆41Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆36Updated 7 months ago
- Entailment self-training☆25Updated last year
- PyTorch implementation for MRL☆18Updated 8 months ago
- ☆24Updated last year
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆12Updated last year
- ☆40Updated this week
- Embedding Recycling for Language models☆38Updated last year
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆13Updated 8 months ago
- ☆14Updated 11 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆24Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated 8 months ago
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- Using short models to classify long texts☆20Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆19Updated 3 months ago