This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax"
☆13Feb 25, 2026Updated last week
Alternatives and similar repositories for TaylorShift
Users that are interested in TaylorShift are comparing it to the libraries listed below
Sorting:
- Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.☆33Feb 25, 2026Updated last week
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆27Jul 21, 2025Updated 7 months ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- ☆63Oct 3, 2024Updated last year
- ☆11Mar 23, 2022Updated 3 years ago
- NRLMFβ: Bata-distribution-rescored Neighborhood Regularized Logistic Matrix Factorization for Improving Performance of Drug–Target Intera…☆11Oct 12, 2021Updated 4 years ago
- Using tensorflow object detection api and openCV to calculate real world coordinates from top view with fixed height of the camera.☆10Jun 19, 2021Updated 4 years ago
- Smart Waste Management System using IoT☆12Jan 9, 2023Updated 3 years ago
- Fine Grained Image Classification with Class Imbalance using Bilinear EfficientNet with Focal Loss and Label Smoothing☆10May 28, 2020Updated 5 years ago
- ☆13Jul 3, 2024Updated last year
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- Code for reproducing the results presented in the paper 'Predify:Augmenting deep neural networks with brain-inspired predictive coding dy…☆10Jun 19, 2022Updated 3 years ago
- This repository provides an implementation of the DTi2Vec tool, to identify Drug-Target interaction using network embedding and ensemble …☆12Sep 28, 2021Updated 4 years ago
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- Chameleon: A MatMul-Free TCN Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data☆26Jun 6, 2025Updated 8 months ago
- Hi, I'm Harmony the Hummingbird! Let's work together on whatever you care about.☆12May 3, 2024Updated last year
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- KAF : Kolmogorov-Arnold Fourier Networks☆20Feb 19, 2025Updated last year
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 8 months ago
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated 10 months ago
- ☆12Jan 12, 2015Updated 11 years ago
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆25Jul 26, 2025Updated 7 months ago
- Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more☆12Feb 16, 2026Updated 2 weeks ago
- ☆11Aug 31, 2023Updated 2 years ago
- DOneLogin Android: Facial verification for Two-Factors Authentication (2FA) on Android platform☆11Mar 30, 2021Updated 4 years ago
- ☆12May 20, 2025Updated 9 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- The official implementation for SETA (TIP 2024).☆11Feb 17, 2025Updated last year
- "Causality: Models, Reasoning, and Inference-Judea Pearl(2009)"中文翻译及学习笔记☆15Feb 18, 2022Updated 4 years ago
- Transformer and Neural Operator for solving Stochastic PDE☆12May 22, 2022Updated 3 years ago
- A server/client approach to face recognition. Aims to be fast, secure and iot friendly. Uses dlib.☆11May 7, 2021Updated 4 years ago
- Track points in a video using the SIFT algorithm and OpenCV.☆13Jan 21, 2020Updated 6 years ago
- Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.☆17Dec 19, 2022Updated 3 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- HSViT: Horizontally Scalable Vision Transformer☆13Nov 6, 2024Updated last year
- Reference implementation of the HEAT algorithm described in https://link.springer.com/chapter/10.1007/978-3-030-62362-3_4☆11Mar 24, 2023Updated 2 years ago