We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
☆95Jul 25, 2024Updated last year
Alternatives and similar repositories for deep-language-networks
Users that are interested in deep-language-networks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer☆47May 30, 2024Updated last year
- MLOps Model Factory is an end to end workflow that supports generating multiple models and used for deployment to any target.☆10May 9, 2024Updated last year
- ☆12Aug 30, 2021Updated 4 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- Official code for FAccT'21 paper "Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning" https://arxiv.org/abs…☆13Mar 9, 2021Updated 5 years ago
- Building modular LMs with parameter-efficient fine-tuning.☆115Jan 18, 2026Updated 2 months ago
- ☆51Jul 7, 2025Updated 8 months ago
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆18Oct 12, 2022Updated 3 years ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆20Apr 21, 2025Updated 11 months ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- [ICML 2024 Spotlight] Differentially Private Synthetic Data via Foundation Model APIs 2: Text☆57Jan 11, 2025Updated last year
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- source code for ICLR'24 paper "How does unlabeled data provably help OOD detection?"☆13Feb 1, 2024Updated 2 years ago
- ☆30Jun 19, 2023Updated 2 years ago
- Accompanying repo for the RLPrompt paper☆361Jun 6, 2024Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- ☆46Apr 10, 2023Updated 2 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- awesome-LLM-controlled-constrained-generation☆55Aug 16, 2024Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Oct 29, 2023Updated 2 years ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆33Apr 2, 2025Updated 11 months ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Graph diagram component for use in Azure Data Studio and mssql for VS Code tools☆16Updated this week
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- A learning environment for man-made Interactive Fiction games.☆322Nov 11, 2025Updated 4 months ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆57Feb 20, 2026Updated last month
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Sep 11, 2024Updated last year
- Simple and extensible hypergradient for PyTorch☆18Feb 23, 2023Updated 3 years ago
- Python API for interactive fiction games☆35May 16, 2022Updated 3 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- ☆31Feb 26, 2026Updated 3 weeks ago
- Mixed integer programming for computing lipschitz constants of ReLU Networks☆17Feb 10, 2023Updated 3 years ago
- Ask Me Anything language model prompting☆547Jul 5, 2023Updated 2 years ago
- ☆38Jul 17, 2024Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆22Sep 7, 2023Updated 2 years ago
- Code for EMNLP2021 paper “Transductive Learning for Unsupervised Text Style Transfer”☆12Sep 19, 2021Updated 4 years ago