flowersteam / LLM-Culture
Code for the "Cultural evolution in populations of Large Language Models" paper
☆32Updated 6 months ago
Alternatives and similar repositories for LLM-Culture:
Users that are interested in LLM-Culture are comparing it to the libraries listed below
- a benchmark to evaluate the situated inductive reasoning☆15Updated 4 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- List of papers on Self-Correction of LLMs.☆72Updated 4 months ago
- Official repo for BOOKWORLD: From Novels to Interactive Agent Societies for Story Creation☆30Updated last week
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆33Updated 6 months ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)☆12Updated 3 weeks ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated 10 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆61Updated 10 months ago
- ☆27Updated last month
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆17Updated 2 months ago
- implementation of dualformer☆16Updated 2 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆12Updated 3 weeks ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆30Updated 5 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆45Updated 3 months ago
- CycleQD is a framework for parameter space model merging.☆39Updated 3 months ago
- ☆36Updated 7 months ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆42Updated 4 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆18Updated last month
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆29Updated 9 months ago
- ☆14Updated last year
- Repository for Skill Set Optimization☆12Updated 9 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated last month
- ☆27Updated 9 months ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆35Updated 2 months ago