PBDESG / nnViewerLinks
☆10Updated 8 months ago
Alternatives and similar repositories for nnViewer
Users that are interested in nnViewer are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Model☆67Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆56Updated last week
- Lego for GRPO☆29Updated 4 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆51Updated last year
- ☆136Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 7 months ago
- ☆36Updated 2 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 4 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 9 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 5 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Torch-activation, a library of activation functions for PyTorch library☆26Updated 5 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆102Updated 9 months ago
- ☆59Updated last year
- ☆119Updated last year
- OpenPipe Reinforcement Learning Experiments☆31Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- KV Cache Steering for Inducing Reasoning in Small Language Models☆40Updated 2 months ago
- LLM training in simple, raw C/CUDA☆15Updated 10 months ago
- Create topological graph for image segments.☆22Updated last year
- ☆67Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated last year
- Train an adapter for any embedding model in under a minute☆127Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 3 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year