eliahuhorwitz / Model-AtlasLinks
☆34Updated 9 months ago
Alternatives and similar repositories for Model-Atlas
Users that are interested in Model-Atlas are comparing it to the libraries listed below
Sorting:
- Public repository containing METR's DVC pipeline for eval data analysis☆199Updated last week
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆56Updated 5 months ago
- ☆21Updated 2 weeks ago
- Digital Red Queen: Adversarial Program Evolution in Core War with LLMs☆176Updated 3 weeks ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.☆117Updated last week
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆24Updated 2 years ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆259Updated last month
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ScalarLM - a unified training and inference stack☆97Updated 2 months ago
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- Transformer GPU VRAM estimator☆68Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Lego for GRPO☆30Updated 8 months ago
- Pivotal Token Search☆145Updated last month
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆58Updated 11 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 10 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated last year
- Granite 3.1 Language Models☆137Updated 7 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- Agent fixing SWE bench issues☆19Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated last week
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆29Updated 10 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago