microsoft / deep-language-networks
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
☆93Updated 8 months ago
Alternatives and similar repositories for deep-language-networks:
Users that are interested in deep-language-networks are comparing it to the libraries listed below
- SILO Language Models code repository☆81Updated last year
- RL algorithm: Advantage induced policy alignment☆65Updated last year
- ☆26Updated 8 months ago
- ☆73Updated 11 months ago
- Building modular LMs with parameter-efficient fine-tuning.☆98Updated last week
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆59Updated last year
- ☆38Updated 5 months ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆77Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- ☆159Updated 2 years ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- ☆44Updated 4 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 6 months ago
- A repository for transformer critique learning and generation☆89Updated last year
- ☆46Updated last year
- ☆38Updated 11 months ago
- A unified benchmark for math reasoning☆87Updated 2 years ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆67Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 7 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆87Updated last year
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- ☆116Updated 8 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆34Updated last year
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year