dnakov / llm-asi-archLinks
🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architecture discovery with local LLM inference on Apple Silicon.
☆26Updated 4 months ago
Alternatives and similar repositories for llm-asi-arch
Users that are interested in llm-asi-arch are comparing it to the libraries listed below
Sorting:
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- look how they massacred my boy☆63Updated last year
- ☆68Updated 6 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆77Updated 10 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- ☆90Updated 11 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 4 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 5 months ago
- Deep research agents using MiniMax-M2 interleaved thinking☆143Updated 3 weeks ago
- alternative way to calculating self attention☆18Updated last year
- Very minimal (and stateless) agent framework☆44Updated 11 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆35Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆72Updated last month
- The State Of The Art, intelligence☆157Updated 4 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- ☆62Updated 5 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated last year
- ☆17Updated 10 months ago
- Simple GRPO scripts and configurations.☆59Updated 10 months ago
- Simple Graph Memory for AI applications☆89Updated 7 months ago
- Really quick-and-dirty example of AI recursive learning☆30Updated last year
- Marketplace ML experiment - training without backprop☆27Updated 3 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 5 months ago
- Plotting (entropy, varentropy) for small LMs☆99Updated 7 months ago
- ☆107Updated last month
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago