PBDESG / nnViewerLinks
☆10Updated last year
Alternatives and similar repositories for nnViewer
Users that are interested in nnViewer are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- Lego for GRPO☆30Updated 8 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- An introduction to LLM Sampling☆79Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated 2 weeks ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- Pokedex for LLMs☆14Updated 9 months ago
- ☆137Updated last year
- ☆39Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 8 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆98Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 3 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆53Updated last year
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated last year
- Arxflix turns your boring Arxiv research paper into a captivating video.☆58Updated 4 months ago
- Torch-activation, a library of activation functions for PyTorch library☆25Updated 9 months ago
- code for training and using chess embeddings models☆13Updated last year
- ☆63Updated last year
- ☆68Updated last year
- entropix style sampling + GUI☆27Updated last year
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 8 months ago
- Set of scripts to finetune LLMs☆38Updated last year
- Collection of autoregressive model implementation☆85Updated 3 weeks ago
- ☆56Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last month
- Fine tune Gemma 3 on an object detection task☆97Updated 6 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year