tairov / QStarLearning.mojoLinks
☆112Updated 2 years ago
Alternatives and similar repositories for QStarLearning.mojo
Users that are interested in QStarLearning.mojo are comparing it to the libraries listed below
Sorting:
- run paligemma in real time☆133Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 3 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆118Updated last year
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- ☆125Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated 4 months ago
- smolLM with Entropix sampler on pytorch☆149Updated last year
- ☆45Updated 2 years ago
- ☆90Updated last year
- ☆119Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Updated 3 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆136Updated last week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- Maybe the new state of the art vision model? we'll see 🤷♂️☆171Updated 2 years ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- look how they massacred my boy☆63Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Ongoing research training transformer models at scale☆38Updated 2 years ago
- Scripts to create your own moe models using mlx☆90Updated last year
- Drop in replacement for OpenAI, but with Open models.☆154Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- ☆22Updated 2 years ago
- Plotting (entropy, varentropy) for small LMs☆99Updated 8 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- GRDN.AI app for garden optimization☆69Updated 2 months ago
- ☆135Updated 2 years ago
- Simplex Random Feature attention, in PyTorch☆75Updated 2 years ago
- run embeddings in MLX☆97Updated last year