sumo43 / loopvlm
run paligemma in real time
☆122Updated 4 months ago
Related projects: ⓘ
- ☆101Updated 5 months ago
- On-device intelligence.☆136Updated last week
- Full finetuning of large language models without large memory requirements☆94Updated 8 months ago
- Fast parallel LLM inference for MLX☆118Updated 2 months ago
- An automated tool for discovering insights from research papaer corpora☆132Updated 3 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆64Updated this week
- ☆114Updated 9 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated last week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- Scripts to create your own moe models using mlx☆86Updated 6 months ago
- Start a server from the MLX library.☆157Updated last month
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆217Updated 6 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆101Updated last week
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆170Updated 5 months ago
- ☆89Updated 11 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆156Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 3 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- Run GGML models with Kubernetes.☆172Updated 9 months ago
- GPT-2 (124M) quality in 5B tokens☆227Updated last week
- A reinforcement learning framework based on MLX.☆215Updated 7 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆192Updated 4 months ago
- Simple Transformer in Jax☆100Updated 2 months ago
- run embeddings in MLX☆68Updated last month
- WIP - Allows you to create DSPy pipelines using ComfyUI☆170Updated last month
- 1.58 Bit LLM on Apple Silicon using MLX☆96Updated 4 months ago
- Long context evaluation for large language models☆148Updated this week
- ☆109Updated last month
- inference code for mixtral-8x7b-32kseqlen☆97Updated 9 months ago