valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆313Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for NeuralFlow
- A library for making RepE control vectors☆481Updated last month
- LLM Analytics☆615Updated last month
- An implementation of bucketMul LLM inference☆214Updated 4 months ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆318Updated 5 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆271Updated this week
- a curated list of data for reasoning ai☆112Updated 3 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆203Updated 6 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- Mistral7B playing DOOM☆122Updated 4 months ago
- Neural Search☆344Updated 5 months ago
- The repository for the code of the UltraFastBERT paper☆514Updated 7 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago
- Stateful load balancer custom-tailored for llama.cpp☆563Updated this week
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆282Updated last month
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆246Updated 2 weeks ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆333Updated 5 months ago
- ☆149Updated 4 months ago
- Inference code for Persimmon-8B☆416Updated last year
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated last year
- a small code base for training large models☆266Updated 3 weeks ago
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆322Updated 5 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated last month
- Fine-tune LLM agents with online reinforcement learning☆995Updated 8 months ago
- Finetune llama2-70b and codellama on MacBook Air without quantization☆447Updated 7 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆221Updated 2 weeks ago
- A pure NumPy implementation of Mamba.☆216Updated 4 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆193Updated this week