cloudygoose / MiniAgents
The MiniAgents visualization tool for simulacra.
☆13Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for MiniAgents
- ☆22Updated last year
- Safety-J: Evaluating Safety with Critique☆13Updated 3 months ago
- ☆29Updated 6 months ago
- 计算语言学22-23学年秋季学期 课程大作业baseline实现☆37Updated last year
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆28Updated last month
- ☆65Updated 6 months ago
- The information of NLP PhD application in the world.☆35Updated 2 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆102Updated 2 months ago
- Evaluating Mathematical Reasoning Beyond Accuracy☆37Updated 7 months ago
- ☆10Updated 4 months ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆25Updated 5 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆30Updated 3 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- A Survey on the Honesty of Large Language Models☆46Updated last month
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 3 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆28Updated 4 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆103Updated 6 months ago
- Feeling confused about super alignment? Here is a reading list☆43Updated 10 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆78Updated last month
- ☆33Updated 9 months ago
- Dive-into-LLMs Tutorial for Beginners☆6Updated 6 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated 5 months ago
- ☆25Updated last month
- The repo for In-context Autoencoder☆89Updated 6 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆76Updated this week
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆37Updated 4 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆65Updated 2 years ago
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆22Updated 2 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆58Updated 8 months ago