Weixin-Liang / Mapping-the-Increasing-Use-of-LLMs-in-Scientific-Papers
☆30Updated 4 months ago
Related projects: ⓘ
- The Prism Alignment Project☆32Updated 4 months ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆79Updated 5 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆28Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- Code/data for MARG (multi-agent review generation)☆24Updated 4 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆59Updated 10 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆54Updated last month
- ☆27Updated last year
- ☆45Updated 7 months ago
- ☆92Updated 4 months ago
- ☆16Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆39Updated 7 months ago
- ☆31Updated last year
- ☆56Updated 9 months ago
- A Retrieval Benchmark for Scientific Literature Search☆53Updated 2 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆27Updated 2 months ago
- Repository for the ACL 2024 conference website☆17Updated last month
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆32Updated 8 months ago
- ☆17Updated 2 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆87Updated 3 months ago
- ☆31Updated 3 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆28Updated last month
- Finding semantically meaningful and accurate prompts.☆45Updated 10 months ago
- Describing changes in LLM research trends in 2023. https://arxiv.org/abs/2307.10700☆15Updated 7 months ago
- ☆32Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆29Updated 7 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆32Updated this week
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆25Updated last month
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆37Updated last year
- ☆12Updated 6 months ago