We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
☆95Jul 25, 2024Updated last year
Alternatives and similar repositories for deep-language-networks
Users that are interested in deep-language-networks are comparing it to the libraries listed below
Sorting:
- MLOps Model Factory is an end to end workflow that supports generating multiple models and used for deployment to any target.☆10May 9, 2024Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆20Jul 8, 2024Updated last year
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer☆46May 30, 2024Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- ☆24Apr 17, 2024Updated last year
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆131Feb 25, 2023Updated 3 years ago
- ☆30Jun 19, 2023Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- The goal of this project was to develop a chat-bot based data collection tool. It asks users questions through a validated alignment surv…☆13Feb 20, 2026Updated last week
- CliniDeID automatically de-identifies clinical text notes according to the HIPAA Safe Harbor method. It accurately finds identifiers and …☆10Aug 13, 2023Updated 2 years ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Boosting Natural Language Generation from Instructions with Meta-Learning☆11Dec 20, 2022Updated 3 years ago
- Graph diagram component for use in Azure Data Studio and mssql for VS Code tools☆14Feb 14, 2026Updated 2 weeks ago
- Collections of Actions for Custom GPTs (some created by Captain Action)☆11Jan 7, 2024Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- ☆10Feb 6, 2025Updated last year
- Official code for FAccT'21 paper "Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning" https://arxiv.org/abs…☆13Mar 9, 2021Updated 4 years ago
- ☆12Aug 30, 2021Updated 4 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Collecting news articles for all the companies in the R1000, for a pre-defined set of news outlets, using Diffbot's Knowledge Graph☆13Jan 26, 2023Updated 3 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- [ICML 2024 Spotlight] Differentially Private Synthetic Data via Foundation Model APIs 2: Text☆55Jan 11, 2025Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Oct 29, 2023Updated 2 years ago
- Accompanying repo for the RLPrompt paper☆361Jun 6, 2024Updated last year
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆56Jun 3, 2024Updated last year
- Add ability to interrupt own message☆14Apr 21, 2024Updated last year
- Extension to connect OpenPAI clusters, submit AI jobs, simulate jobs locally, manage files, and so on.☆15Dec 10, 2022Updated 3 years ago
- ☆18Oct 12, 2022Updated 3 years ago
- ☆22Dec 8, 2025Updated 2 months ago
- Code and Data for: Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming☆33Feb 23, 2024Updated 2 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆65Feb 13, 2023Updated 3 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Nov 27, 2024Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Sep 11, 2024Updated last year
- Code Repository for "A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models".☆15Oct 14, 2022Updated 3 years ago
- Pytorch implementations of Co-teaching for noisy label learning☆13Jun 28, 2022Updated 3 years ago