Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
β57Updated last year
Alternatives and similar repositories for NeurIPS-llm-efficiency-challenge:
Users that are interested in NeurIPS-llm-efficiency-challenge are comparing it to the libraries listed below
- β47Updated 7 months ago
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ55Updated 7 months ago
- β24Updated last year
- β15Updated last year
- β48Updated 5 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β33Updated last year
- Codebase accompanying the Summary of a Haystack paper.β77Updated 6 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- PyTorch implementation for MRLβ18Updated last year
- A library for squeakily cleaning and filtering language datasets.β46Updated last year
- β33Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- β34Updated 9 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- β40Updated 2 months ago
- QLoRA with Enhanced Multi GPU Supportβ37Updated last year
- β44Updated 4 months ago
- Adversarial Training and SFT for Bot Safety Modelsβ39Updated last year
- β67Updated 7 months ago
- QLoRA for Masked Language Modelingβ22Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ41Updated last year
- A repository for research on medium sized language models.β76Updated 10 months ago
- Aioli: A unified optimization framework for language model data mixingβ23Updated 2 months ago
- β46Updated 5 months ago
- Collection of autoregressive model implementationβ85Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillationβ28Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"β27Updated last month