WhitzardIndex / self-replication-research
A preprint version of our recent research on the capability of frontier AI systems to do self-replication
☆59Updated 4 months ago
Alternatives and similar repositories for self-replication-research
Users that are interested in self-replication-research are comparing it to the libraries listed below
Sorting:
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆36Updated last month
- ☆12Updated last week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- ☆97Updated 7 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆68Updated 3 months ago
- Open collaboration infrastructure that enables communication, coordination, trust and payments for The Internet of Agents.☆57Updated this week
- ☆72Updated last week
- look how they massacred my boy☆63Updated 7 months ago
- ☆28Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆33Updated last week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆75Updated 2 weeks ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 6 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 7 months ago
- Keeping my personal experiments separate from the main repo☆65Updated 3 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆26Updated 10 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆65Updated last month
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆51Updated last week
- A simple experiment on letting two local LLM have a conversation about anything!☆109Updated 10 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated last month
- entropix style sampling + GUI☆26Updated 6 months ago
- The next evolution of Agents☆48Updated 3 weeks ago
- ☆65Updated 2 months ago
- OpenPipe Reinforcement Learning Experiments☆24Updated 2 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆47Updated 3 months ago
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆42Updated last month
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 7 months ago
- ☆77Updated 6 months ago