neubig / minllama-assignment
☆47Updated this week
Related projects: ⓘ
- An assignment for building an NLP system from scratch.☆16Updated 6 months ago
- ☆139Updated 8 months ago
- CS 224N Winter 2023 Default Final Project: Multitask BERT☆24Updated last year
- Minimalist BERT implementation assignment for CS11-711☆73Updated last year
- ☆115Updated 3 months ago
- ☆87Updated 3 months ago
- ☆170Updated last month
- ☆74Updated this week
- Website☆47Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆174Updated last week
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆59Updated 10 months ago
- ☆45Updated 7 months ago
- A Survey on Data Selection for Language Models☆148Updated 3 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆25Updated last month
- ☆105Updated this week
- Evaluating LLMs with fewer examples☆131Updated 5 months ago
- A Multilingual Replicable Instruction-Following Model☆91Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆48Updated 6 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆101Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 4 months ago
- ☆38Updated 5 months ago
- ☆114Updated 2 weeks ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆135Updated last month
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆52Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- Supercharge huggingface transformers with model parallelism.☆72Updated 6 months ago
- An Apache 2.0 fork of HuggingFace's Large Language Model Text Generation Inference☆19Updated 6 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆107Updated last month
- Official implementation of DPFM @ ICLR 2024 paper "Autonomous Data Selection with Language Models for Mathematical Texts" (Huggingface Da…☆73Updated this week
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆198Updated 10 months ago