mrconter1 / human-level-agi-definitionLinks
☆30Updated 11 months ago
Alternatives and similar repositories for human-level-agi-definition
Users that are interested in human-level-agi-definition are comparing it to the libraries listed below
Sorting:
- ☆112Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆66Updated this week
- run paligemma in real time☆133Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- ☆89Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆33Updated last year
- Editor with LLM generation tree exploration☆79Updated 9 months ago
- alternative way to calculating self attention☆18Updated last year
- Lightweight Llama 3 8B Inference Engine in CUDA C☆52Updated 8 months ago
- look how they massacred my boy☆63Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ScalarLM - a unified training and inference stack☆93Updated this week
- explore token trajectory trees on instruct and base models☆148Updated 5 months ago
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 9 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- ☆38Updated 8 months ago
- Lego for GRPO☆30Updated 5 months ago
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆25Updated 3 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆110Updated this week
- Transformer GPU VRAM estimator☆66Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- lossily compress representation vectors using product quantization☆59Updated 3 weeks ago
- Mistral7B playing DOOM☆138Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆98Updated 6 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 weeks ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆26Updated 5 months ago
- A character-level language diffusion model trained on Tiny Shakespeare☆565Updated last week
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 3 months ago