dnakov / llm-asi-archLinks
🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architecture discovery with local LLM inference on Apple Silicon.
☆25Updated 2 months ago
Alternatives and similar repositories for llm-asi-arch
Users that are interested in llm-asi-arch are comparing it to the libraries listed below
Sorting:
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- look how they massacred my boy☆63Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆72Updated 8 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆44Updated 8 months ago
- ☆36Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 8 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆16Updated 11 months ago
- ☆68Updated 4 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆69Updated last year
- ☆89Updated 9 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated 11 months ago
- Very minimal (and stateless) agent framework☆45Updated 9 months ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 4 months ago
- Verbosity control for AI agents☆65Updated last year
- ☆22Updated 4 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…☆20Updated 8 months ago
- ☆62Updated 3 months ago
- entropix style sampling + GUI☆27Updated 11 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated last year
- ☆45Updated last month
- ☆15Updated 2 months ago
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- ☆104Updated 4 months ago
- OpenPipe Reinforcement Learning Experiments☆31Updated 7 months ago
- Lego for GRPO☆30Updated 4 months ago
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 9 months ago