microsoft / deep-language-networks
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
☆94Updated 9 months ago
Alternatives and similar repositories for deep-language-networks
Users that are interested in deep-language-networks are comparing it to the libraries listed below
Sorting:
- SILO Language Models code repository☆81Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆70Updated 10 months ago
- A unified benchmark for math reasoning☆88Updated 2 years ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆103Updated 2 years ago
- Building modular LMs with parameter-efficient fine-tuning.☆104Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- ☆60Updated last year
- ☆46Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆84Updated 6 months ago
- ☆72Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆117Updated 5 months ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 8 months ago
- Code repository for the c-BTM paper☆106Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆76Updated 2 years ago
- ☆38Updated last year
- ☆82Updated 9 months ago
- ☆28Updated 10 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆30Updated 3 months ago
- Self-Alignment with Principle-Following Reward Models☆161Updated last week
- ☆120Updated 7 months ago
- ☆45Updated last year