microsoft / deep-language-networks
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
☆91Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for deep-language-networks
- SILO Language Models code repository☆80Updated 8 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆61Updated 4 months ago
- ☆71Updated 6 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆62Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆96Updated last year
- ☆23Updated 3 months ago
- A repository for transformer critique learning and generation☆85Updated 11 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆91Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- ☆46Updated last month
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- Building modular LMs with parameter-efficient fine-tuning.☆80Updated this week
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆43Updated 10 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆72Updated 3 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 9 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆79Updated last year
- AI Logging for Interpretability and Explainability🔬☆87Updated 5 months ago
- ☆44Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆109Updated last year
- RL algorithm: Advantage induced policy alignment☆62Updated last year
- Codebase for Inference-Time Policy Adapters☆21Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆107Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 8 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆156Updated 6 months ago
- [NeurIPS 2023] Learning Transformer Programs☆157Updated 5 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆56Updated 2 months ago
- ☆28Updated 7 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆97Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆147Updated 3 months ago