Implémentation of the article **Deep Learning CUDA Memory Usage and Pytorch optimization tricks**
☆43Jan 13, 2020Updated 6 years ago
Alternatives and similar repositories for article-memory-log
Users that are interested in article-memory-log are comparing it to the libraries listed below
Sorting:
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 2 years ago
- Implementation of OpenAI paper with Simple Noise Scale on Fastai V2☆19Apr 16, 2021Updated 4 years ago
- this repository is created to accumulate all LaTeX templates needed at Skoltech☆20Nov 27, 2018Updated 7 years ago
- Detecting gibberish as a type of sentiment analysis with GPT2☆25Nov 10, 2020Updated 5 years ago
- Investigating multilingual language models (BERT) by using them for NER in German and English☆14Apr 30, 2019Updated 6 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 9 months ago
- An AI agent that solves Raven's Progressive Matrices☆17Aug 27, 2016Updated 9 years ago
- This is a sample implementation of "TIMERS: Error-Bounded SVD Restart on Dynamic Networks"(AAAI 2018).☆12Jul 4, 2018Updated 7 years ago
- Russian FrameBank offline resources☆13Mar 27, 2020Updated 5 years ago
- Using synthetically generated video data to learn how/when R-CNNs can outperform CNNs.☆17Mar 18, 2018Updated 8 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- ☆11Oct 31, 2021Updated 4 years ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆59Jun 26, 2022Updated 3 years ago
- Instant Graph Neural Networks for Dynamic Graphs☆11Dec 28, 2022Updated 3 years ago
- code and data for Improving Temporal Link Prediction via Temporal Walk Matrix Projection, NeurIPS 2024☆13Oct 5, 2024Updated last year
- Official repo for FunkNN: Neural Interpolation for Functional Generation☆11May 12, 2023Updated 2 years ago
- Sequence tagger based on BERT☆20Apr 28, 2022Updated 3 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆28Jul 30, 2025Updated 7 months ago
- ☆45Nov 1, 2025Updated 4 months ago
- This github contains the implementation of the method proposed in MDGNN_BS paper☆12May 9, 2024Updated last year
- A curated list for interpretable machine learning☆18Jan 4, 2019Updated 7 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Jan 16, 2023Updated 3 years ago
- ☆15Mar 2, 2025Updated last year
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆43Nov 18, 2025Updated 4 months ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs☆13Nov 6, 2020Updated 5 years ago
- A Python Implementation of GLAD☆24Jan 18, 2021Updated 5 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- A Python library that integrates the Curvelet transform into differentiable programming pipelines, such as PyTorch and JAX.☆12Jun 22, 2023Updated 2 years ago
- RuSimpleSentEval (RSSE) shared task repo☆21Apr 26, 2021Updated 4 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 9 months ago
- This is the code which powers the Twitter Bot https://twitter.com/RGB_Colours☆15Apr 14, 2017Updated 8 years ago
- ☆15Jul 18, 2024Updated last year