epoch-research / data-stockLinks
Models for data stocks and training dataset sizes
☆18Updated last year
Alternatives and similar repositories for data-stock
Users that are interested in data-stock are comparing it to the libraries listed below
Sorting:
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆63Updated 9 months ago
- ☆20Updated last month
- Landing page + leaderboard for SWE-Bench benchmark☆10Updated this week
- Fluid Language Model Benchmarking☆26Updated 4 months ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated 8 months ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆49Updated 2 years ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 5 months ago
- Forecasting high-impact research topics via machine learning on evolving knowledge graphs☆47Updated 2 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated last week
- ☆105Updated 6 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆186Updated last week
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆87Updated 11 months ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆54Updated 9 months ago
- ☆55Updated last year
- ☆186Updated last week
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- ☆25Updated 2 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆199Updated 10 months ago
- Official repository of the 2025 paper, LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra.☆63Updated last week
- ☆91Updated last month
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated last week
- ☆52Updated 10 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆94Updated last week
- ☆62Updated 2 years ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Updated 6 months ago
- Minimum Description Length probing for neural network representations☆20Updated last year
- Discovering Data-driven Hypotheses in the Wild☆127Updated 7 months ago
- Simple repository for training small reasoning models☆48Updated 11 months ago