loeweX / Custom-ConvLayers-PytorchLinks

A reimplementation of 2D Convolutional and Transposed Convolutional Layers in PyTorch, designed for easy modifications and analysis. Includes comprehensive explanations and testing.

☆20

Alternatives and similar repositories for Custom-ConvLayers-Pytorch

Users that are interested in Custom-ConvLayers-Pytorch are comparing it to the libraries listed below

Sorting:

taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆48Updated last year
augustwester / transformer-xl
A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
☆37Updated 2 years ago
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
EleutherAI / features-across-time
Understanding how features learned by neural networks evolve throughout training
☆36Updated 8 months ago
KaiNylund / lm-weights-encode-time
☆68Updated 11 months ago
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 8 months ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 2 years ago
lucaslingle / mu_transformer
Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.
☆31Updated last month
tech-srl / layer_norm_expressivity_role
Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)
☆56Updated 9 months ago
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 6 months ago
EleutherAI / mdl
Minimum Description Length probing for neural network representations
☆18Updated 5 months ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
google-research / precondition
☆31Updated 3 weeks ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Updated last year
srush / Tensor-Puzzles-Penzai
☆20Updated last year
smearle / autoverse
Generative cellular automaton-like learning environments for RL.
☆19Updated 5 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆33Updated 5 months ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
rovle / gpt3-in-context-fitting
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Updated 2 years ago
teddykoker / tinyloader
☆66Updated 3 months ago
irhum / hyena
JAX/Flax implementation of the Hyena Hierarchy
☆34Updated 2 years ago
ChrisHayduk / QLoRA-for-MLM
QLoRA for Masked Language Modeling
☆22Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 9 months ago
apoorvkh / torchrunx
Easily run PyTorch on multiple GPUs & machines
☆46Updated 3 weeks ago
Sea-Snell / CALM-Dialogue
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Updated 2 years ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆35Updated last year
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
google-deepmind / enn_acme
☆31Updated 2 years ago
johnrobinsn / redpajama
Training and Inference Notebooks for the RedPajama (OpenLlama) models
☆18Updated 2 years ago