epoch-research / data-stockLinks
Models for data stocks and training dataset sizes
☆18Updated last year
Alternatives and similar repositories for data-stock
Users that are interested in data-stock are comparing it to the libraries listed below
Sorting:
- ☆19Updated last month
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated 3 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆31Updated 11 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆55Updated 7 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆48Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆94Updated this week
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆52Updated last month
- ☆41Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆110Updated 5 months ago
- Official repository of the 2025 paper, LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra.☆40Updated 2 months ago
- ☆97Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆103Updated last month
- ☆53Updated last year
- Discovering Data-driven Hypotheses in the Wild☆111Updated 3 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆24Updated 2 months ago
- Minimum Description Length probing for neural network representations☆18Updated 8 months ago
- 🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆99Updated 5 months ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆50Updated 5 months ago
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"☆110Updated last year
- ☆174Updated 2 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆20Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 2 months ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆46Updated last year
- ☆81Updated this week
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 9 months ago
- A Python Library for Learning Non-Euclidean Representations☆63Updated last month