awni / picochatLinks
Smaller and faster nanochat in MLX
☆36Updated 2 months ago
Alternatives and similar repositories for picochat
Users that are interested in picochat are comparing it to the libraries listed below
Sorting:
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated this week
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Updated last week
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated last week
- ☆50Updated 3 months ago
- mlx image models for Apple Silicon machines☆91Updated 2 months ago
- Lego for GRPO☆30Updated 8 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- ☆68Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 11 months ago
- A collection of optimizers for MLX☆54Updated last month
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- Implementation of nougat that focuses on processing pdf locally.☆84Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 9 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated last year
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 8 months ago
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆94Updated 2 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated last year
- ☆21Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆109Updated 8 months ago
- Montelimar - Extract text from anywhere☆87Updated 4 months ago
- ☆27Updated last year
- ☆39Updated 6 months ago
- smolbox of recipies☆29Updated 9 months ago
- Rust Implementation of micrograd☆53Updated last year
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Updated 7 months ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago