huggingface / paper-style-guideLinks

☆72

Alternatives and similar repositories for paper-style-guide

Users that are interested in paper-style-guide are comparing it to the libraries listed below

Sorting:

ThomasRobertFr / deep-learning-figures
Figures I made during my PhD in Deep Learning, for my models and for context
☆84Updated 4 years ago
EfficientDL / book
PDFs and Codelabs for the Efficient Deep Learning book.
☆203Updated 2 years ago
causalNLP / AI-Scholar
☆23Updated 2 years ago
enochkan / torch-metrics
Metrics for model evaluation in pytorch
☆110Updated 4 years ago
thegregyang / LossUpAccUp
Loss and accuracy go opposite ways...right?
☆95Updated 5 years ago
epfml / collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
☆151Updated 2 years ago
lucidrains / learning-to-expire-pytorch
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Updated 5 years ago
dair-ai / awesome-research-proposals-guide
A guide to improve your research proposals.
☆202Updated 5 years ago
MathInf / toroidal
a lightweight transformer library for PyTorch
☆72Updated 4 years ago
shreyansh26 / ML-Optimizers-JAX
Toy implementations of some popular ML optimizers using Python/JAX
☆44Updated 4 years ago
stanislavfort / adversaries_to_convnext
Adversarial examples to the new ConvNeXt architecture
☆20Updated 3 years ago
dlg4nlp / dlg4nlp.github.io
This website is to host a series of tutorials on Deep Learning on Graphs for Natural Language Processing.
☆13Updated 3 years ago
sharonzhou / ICLR2021-Stats
ICLR 2021 Stats & Graphs
☆31Updated 3 years ago
ahthie7u / cockpit
Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"
☆31Updated 5 years ago
pkuzengqi / Skyformer
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)
☆63Updated 3 years ago
sayakpaul / robustness-foundation-models
This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.
☆72Updated 2 years ago
CyndxAI / QKNorm
Code for the paper "Query-Key Normalization for Transformers"
☆49Updated 4 years ago
vra / flopth
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
☆131Updated last year
xtinkt / editable
A supplementary code for Editable Neural Networks, an ICLR 2020 submission.
☆46Updated 5 years ago
leaderj1001 / Synthesizer-Rethinking-Self-Attention-Transformer-Models
Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch
☆70Updated 5 years ago
Lightning-Universe / lightning-ColossalAI
Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI
☆56Updated 2 years ago
lucidrains / multistream-transformers
Implementation of Multistream Transformers in Pytorch
☆54Updated 4 years ago
rasbt / faster-pytorch-blog
Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy
☆129Updated 2 years ago
MaxHalford / pytorch-resample
🎲 Iterable dataset resampling in PyTorch
☆92Updated 3 years ago
wilile26811249 / Fastformer-PyTorch
Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."
☆133Updated 4 years ago
NVIDIA / transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Updated 3 years ago
MAC-AutoML / YOCO-BERT
The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…
☆48Updated 4 years ago
google-research / head2toe
☆81Updated last year
lucidrains / remixer-pytorch
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Updated 4 years ago
sIncerass / powernorm
[ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845
☆120Updated 4 years ago