microsoft / deep-language-networksLinks
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
☆94Updated last year
Alternatives and similar repositories for deep-language-networks
Users that are interested in deep-language-networks are comparing it to the libraries listed below
Sorting:
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- SILO Language Models code repository☆82Updated last year
- ☆44Updated 10 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- ☆149Updated last year
- RL algorithm: Advantage induced policy alignment☆65Updated 2 years ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- ☆76Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆59Updated last year
- ☆31Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆116Updated 2 years ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last week
- Language models scale reliably with over-training and on downstream tasks☆100Updated last year
- ☆54Updated 2 years ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆82Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆123Updated 10 months ago
- Self-Alignment with Principle-Following Reward Models☆165Updated 3 weeks ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆71Updated 2 years ago
- ☆78Updated 6 months ago
- Few-shot Learning with Auxiliary Data☆31Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆47Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 8 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆90Updated 2 years ago