nreHieW / loss
Visualising Losses in Deep Neural Networks
☆15Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for loss
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆43Updated last month
- Utilities for PyTorch distributed☆23Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆19Updated 3 months ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 5 months ago
- Load any clip model with a standardized interface☆21Updated 6 months ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆30Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 2 years ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆77Updated 2 weeks ago
- ☆29Updated 2 years ago
- Hacks for PyTorch☆17Updated last year
- Collection of autoregressive model implementation☆67Updated this week
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated last year
- Directed masked autoencoders☆14Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆53Updated 2 months ago
- Exploration into the Firefly algorithm in Pytorch☆35Updated 2 months ago
- GoldFinch and other hybrid transformer components☆40Updated 4 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago