CosineAI / experiments
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆15Updated 8 months ago
Alternatives and similar repositories for experiments
Users that are interested in experiments are comparing it to the libraries listed below
Sorting:
- ☆14Updated last month
- ☆20Updated last week
- The world's first fully automated VC fund.☆23Updated 3 weeks ago
- ☆21Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆33Updated last week
- ☆21Updated 6 months ago
- ☆13Updated last month
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 6 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆26Updated this week
- ☆22Updated last year
- ☆9Updated 3 weeks ago
- ☆20Updated 2 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- ☆41Updated 5 months ago
- Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'☆16Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 10 months ago
- Repository containing dataset, models and code associated with the CHIME project☆15Updated 8 months ago
- ☆18Updated 7 months ago
- The Swarm Ecosystem☆20Updated 9 months ago
- ☆50Updated 5 months ago
- ☆48Updated 6 months ago
- ☆24Updated last year
- ☆16Updated 2 months ago
- Knowledge Graph Generator app☆31Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆18Updated this week
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Interactive Textbook Demo☆42Updated last year
- ☆27Updated last month