samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆212Updated last year
Related projects: ⓘ
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆249Updated 9 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆269Updated last month
- ☆230Updated 5 months ago
- ☆249Updated last year
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆94Updated 6 months ago
- Revealing example of self-attention, the building block of transformer AI models☆130Updated last year
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆344Updated this week
- throwaway GPT inference☆139Updated 3 months ago
- A pure NumPy implementation of Mamba.☆212Updated 2 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated 9 months ago
- Visualize the intermediate output of Mistral 7B☆300Updated 7 months ago
- ☆162Updated 3 months ago
- ☆36Updated last year
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆288Updated 3 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆190Updated 3 months ago
- Stateful load balancer custom-tailored for llama.cpp☆523Updated this week
- LLM Analytics☆593Updated last month
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆196Updated 2 weeks ago
- Deep learning accelerator architectures requiring half the multipliers☆259Updated 5 months ago
- An implementation of bucketMul LLM inference☆212Updated 2 months ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆228Updated this week
- Neural Search☆333Updated 3 months ago
- a small code base for training large models☆261Updated this week
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆202Updated last week
- Mistral7B playing DOOM☆117Updated 2 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆125Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆260Updated last month
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆256Updated this week
- Autograd to GPT-2 completely from scratch☆104Updated last month
- Lamport's Bakery Algorithm Demonstrated in Python☆94Updated 8 months ago