craffel / llm-seminarView external linksLinks
Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)
☆314Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for llm-seminar
Users that are interested in llm-seminar are comparing it to the libraries listed below
Sorting:
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆457Sep 6, 2023Updated 2 years ago
- ☆72May 22, 2023Updated 2 years ago
- Course repository for the Spring 2022 COMP790 course "Deep Learning" at UNC☆19Apr 13, 2022Updated 3 years ago
- ☆15Aug 18, 2022Updated 3 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- ☆99Jul 25, 2023Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆158Jan 6, 2023Updated 3 years ago
- ☆290Dec 2, 2022Updated 3 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 3 years ago
- git extension for {collaborative, communal, continual} model development☆217Nov 14, 2024Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆176Jul 31, 2023Updated 2 years ago
- ☆77Apr 29, 2024Updated last year
- SILO Language Models code repository☆83Feb 23, 2024Updated last year
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- Toolkit for creating, sharing and using natural language prompts.☆2,997Oct 23, 2023Updated 2 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- ☆91May 21, 2022Updated 3 years ago
- ☆184May 26, 2023Updated 2 years ago
- Scaling Data-Constrained Language Models☆340Jun 28, 2025Updated 7 months ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated 10 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 2 years ago
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆766Jul 20, 2023Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- Course repository for the Spring COMP790 course "Deep Learning" at UNC☆23Feb 2, 2022Updated 4 years ago
- ☆24Dec 2, 2023Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆195Jun 14, 2023Updated 2 years ago
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆22Sep 23, 2022Updated 3 years ago
- ☆2,947Jan 15, 2026Updated last month
- Course materials for 11-767☆13Nov 10, 2022Updated 3 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- [EACL'23] MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages☆23Feb 13, 2023Updated 3 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year