marcelbinz / Llama-3.1-Centaur-70BLinks
☆187Updated 5 months ago
Alternatives and similar repositories for Llama-3.1-Centaur-70B
Users that are interested in Llama-3.1-Centaur-70B are comparing it to the libraries listed below
Sorting:
- ☆276Updated 8 months ago
- ☆36Updated last year
- ☆44Updated last year
- Largest, cross-domain data set of human behavior.☆85Updated 5 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 9 months ago
- large population models☆547Updated 3 weeks ago
- This is the repository for brain state prediction using fMRI data and transformer.☆81Updated last year
- CodeScientist: An automated scientific discovery system for code-based experiments☆304Updated 3 weeks ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆193Updated 9 months ago
- 🧬 The Huxley-Gödel Machine☆314Updated 3 weeks ago
- Source code for <Large language models surpass human experts in predicting neuroscience results>☆82Updated last year
- ☆146Updated last year
- Automated Research Assistant☆70Updated 3 weeks ago
- Public repository containing METR's DVC pipeline for eval data analysis☆144Updated 8 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆85Updated 9 months ago
- Specification for creating reliable LLM-based conversational agents☆64Updated last month
- Training and evaluating encoding models to predict fMRI brain responses to naturalistic video stimuli☆292Updated 2 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆87Updated 2 weeks ago
- ☆473Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆211Updated 3 months ago
- Code for ExploreTom☆89Updated 5 months ago
- Open source interpretability artefacts for R1.☆165Updated 7 months ago
- An agent orchestration framework for economic agents☆109Updated 4 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆237Updated 7 months ago
- Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise☆32Updated last year
- ☆79Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆124Updated 10 months ago
- MIRIAD is a million-scale Medical Instruction and Retrieval Datatset☆135Updated 3 weeks ago