RobertCsordas / linear_layer_as_attentionView external linksLinks
The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention".
☆16Jun 11, 2025Updated 8 months ago
Alternatives and similar repositories for linear_layer_as_attention
Users that are interested in linear_layer_as_attention are comparing it to the libraries listed below
Sorting:
- Spectral Attention Autoregressive Model (SAAM)☆16Oct 27, 2022Updated 3 years ago
- ☆14Nov 20, 2022Updated 3 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆26Jul 26, 2023Updated 2 years ago
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Dec 23, 2023Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 8 months ago
- ☆26Feb 27, 2022Updated 3 years ago
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 10 months ago
- Code for Cross-Modal 3D Shape Generation and Manipulation (ECCV 2022)☆29May 23, 2023Updated 2 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆22Nov 13, 2025Updated 3 months ago
- Assessing spectral estimation methods for Electric Network Frequency (ENF) Extraction☆10Jan 10, 2020Updated 6 years ago
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Apr 22, 2021Updated 4 years ago
- ☆16Feb 22, 2025Updated 11 months ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- ☆12May 30, 2023Updated 2 years ago
- ☆11Feb 28, 2022Updated 3 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- Multi-labels anime image classification in rust☆12Mar 10, 2023Updated 2 years ago
- Python + Octave code for measuring surface height using fringe deflectometry. Probably not useful to anyone else at this point.☆11Jan 6, 2014Updated 12 years ago
- Official code for "Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Trans…☆10Sep 11, 2024Updated last year
- Autoencoder for multi-label classification using Google's Tensorflow framework and MDMR for feature selection.☆10Aug 31, 2017Updated 8 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆11Nov 16, 2021Updated 4 years ago
- ☆11Jan 16, 2025Updated last year
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- A Tool for Intersecting Context-Free Grammars☆10Dec 19, 2017Updated 8 years ago
- 3D Scene Flow Estimation☆15Sep 24, 2025Updated 4 months ago
- I'm trying to learn calculus before taking a calculus course☆24Dec 11, 2024Updated last year
- ☆11Nov 23, 2020Updated 5 years ago
- A feishu bot daily push arxiv latest articles.☆10Nov 28, 2021Updated 4 years ago
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations☆12Dec 10, 2024Updated last year
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago