Laz4rz / matryoshka
Implementation of "Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions"
☆16Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for matryoshka
- An automated tool for discovering insights from research papaer corpora☆135Updated 5 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆56Updated 3 weeks ago
- smolLM with Entropix sampler on pytorch☆141Updated 3 weeks ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆112Updated last week
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆43Updated 6 months ago
- Graph Neural Network library made for Apple Silicon☆171Updated last month
- MLX implementation of xLSTM model by Beck et al. (2024)☆25Updated 5 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆19Updated last month
- run paligemma in real time☆123Updated 6 months ago
- A really tiny autograd engine☆87Updated 7 months ago
- ☆70Updated this week
- ☆95Updated last month
- ☆36Updated 3 months ago
- A reinforcement learning framework based on MLX.☆221Updated 9 months ago
- Routing on Random Forest (RoRF)☆84Updated 2 months ago
- look how they massacred my boy☆58Updated last month
- ☆27Updated 4 months ago
- ☆81Updated 2 months ago
- Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposes☆75Updated 8 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆48Updated last year
- Official homepage for "Self-Harmonized Chain of Thought"☆83Updated 2 months ago
- Generate 3Blue1Brown style videos for teaching and visualizing any concept☆53Updated 5 months ago
- Simple Transformer in Jax☆119Updated 5 months ago
- ☆106Updated 3 months ago
- ☆39Updated 9 months ago
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆73Updated this week
- LLM training in simple, raw C/CUDA☆12Updated last month
- ☆16Updated last month
- Andrej Kapathy's micrograd implemented in c☆29Updated 3 months ago
- alternative way to calculating self attention☆18Updated 6 months ago