JirkaKlimes / jit-implementation
π JIT Implementation: Code That Writes Itself
β100Updated last month
Related projects β
Alternatives and complementary repositories for jit-implementation
- An introduction to LLM Samplingβ62Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokensβ105Updated 2 weeks ago
- β116Updated 2 months ago
- PyTorch implementation of models from the Zamba2 series.β158Updated this week
- Alice in Wonderland code base for experiments and raw experiments dataβ108Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β84Updated 2 months ago
- The history files when recording human interaction while solving ARC tasksβ94Updated this week
- β94Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.β172Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β201Updated 6 months ago
- look how they massacred my boyβ54Updated 3 weeks ago
- β223Updated 3 weeks ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.β333Updated last week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for trβ¦β46Updated 2 weeks ago
- prime (previously called ZeroBand) is a framework for efficient, globally distributed training of AI models over the internet.β207Updated this week
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many differentβ¦β270Updated 2 weeks ago
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overheadβ69Updated this week
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.β233Updated last week
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conteβ¦β106Updated last month
- code for training & evaluating Contextual Document Embedding modelsβ93Updated this week
- β100Updated 3 months ago
- Video+code lecture on building nanoGPT from scratchβ64Updated 4 months ago
- Training Models Dailyβ17Updated 10 months ago
- Unsloth Studioβ29Updated last week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersβ104Updated last month
- smolLM with Entropix sampler on pytorchβ137Updated last week
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the userβ¦β151Updated this week
- Visualize the intermediate output of Mistral 7Bβ312Updated 8 months ago
- Gpu benchmarkβ43Updated last month
- π€ Headless IDE for AI agentsβ129Updated this week