elicit / fave-datasetLinks

Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"

☆13

Alternatives and similar repositories for fave-dataset

Users that are interested in fave-dataset are comparing it to the libraries listed below

Sorting:

LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
arcee-ai / DAM
☆55Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆31Updated 10 months ago
catid / lllm
Latent Large Language Models
☆19Updated last year
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated 2 years ago
da03 / WildVisualizer
☆25Updated last week
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated 5 months ago
likenneth / persona_drift
Measuring and Controlling Persona Drift in Language Model Dialogs
☆20Updated last year
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 7 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆44Updated last year
Zyphra / Zyda_processing
☆39Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
Aleph-Alpha-Research / trigrams
☆58Updated last week
VITA-Group / o1-planning
[NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
☆41Updated 4 months ago
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 9 months ago
zhangir-azerbayev / proof-pile
Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.
☆21Updated 3 years ago
nec-research / agentquest
☆28Updated 7 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
prateeky2806 / ComPEFT
☆26Updated 2 years ago
austrian-code-wizard / c3po
☆29Updated 3 months ago
ctlllll / understanding_llm_benchmarks
Understanding the correlation between different LLM benchmarks
☆29Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
huggingface / feel
☆14Updated 5 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆110Updated 11 months ago