eliahuhorwitz / Model-AtlasLinks
☆30Updated 5 months ago
Alternatives and similar repositories for Model-Atlas
Users that are interested in Model-Atlas are comparing it to the libraries listed below
Sorting:
- Public repository containing METR's DVC pipeline for eval data analysis☆117Updated 6 months ago
- Transformer GPU VRAM estimator☆66Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆24Updated 2 years ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆181Updated last week
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆52Updated last month
- Pivotal Token Search☆127Updated 2 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆50Updated last week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 6 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Updated last year
- ScalarLM - a unified training and inference stack☆85Updated last week
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated this week
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆57Updated 9 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 9 months ago
- Granite 3.1 Language Models☆127Updated 3 months ago
- This repository contains code to generate and preprocess Learning with Errors (LWE) data and implementations of four LWE attacks uSVP, SA…☆53Updated 4 months ago
- ☆25Updated 2 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 11 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆79Updated 7 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆118Updated this week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated last year
- An introduction to DSPy☆32Updated last month
- lossily compress representation vectors using product quantization☆59Updated 5 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆117Updated last month
- Run SWE-bench evaluations remotely☆41Updated last month
- Your buddy in the (L)LM space.☆64Updated last year
- ☆55Updated 3 months ago
- ☆61Updated 4 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 10 months ago