microsoft / automated-brain-explanationsLinks
Generating and validating natural-language explanations for the brain.
☆62Updated last week
Alternatives and similar repositories for automated-brain-explanations
Users that are interested in automated-brain-explanations are comparing it to the libraries listed below
Sorting:
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆95Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆19Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- Minimum Description Length probing for neural network representations☆20Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated 3 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆23Updated 2 years ago
- ☆29Updated 3 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆19Updated 4 months ago
- SILO Language Models code repository☆83Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆58Updated last year
- Universal Neurons in GPT2 Language Models☆30Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆24Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 2 months ago
- ☆44Updated 4 years ago
- ☆23Updated last year
- ☆29Updated 11 months ago
- Entailment self-training☆26Updated 2 years ago
- ☆11Updated 2 years ago
- ☆24Updated last month
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Updated last year
- ☆17Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆60Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year