bojone / softtopkLinks

differentiable top-k operator

☆21

Alternatives and similar repositories for softtopk

Users that are interested in softtopk are comparing it to the libraries listed below

Sorting:

megvii-research / IntLLaMA
IntLLaMA: A fast and light quantization solution for LLaMA
☆18Updated last year
HubHop / vit-attention-benchmark
Benchmarking Attention Mechanism in Vision Transformers.
☆18Updated 2 years ago
BBuf / flash-rwkv
☆31Updated last year
vedaldi / micro_llama
A tiny, didactical implementation of LLAMA 3
☆41Updated 6 months ago
fla-org / flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
☆50Updated 4 months ago
alihassanijr / TorchKMeans
A torch-based implementation of K-Means and K-Means++
☆17Updated 4 years ago
bojone / FSQ
Keras implement of Finite Scalar Quantization
☆75Updated last year
savadikarc / gift
GIFT: Generative Interpretable Fine-Tuning
☆20Updated 8 months ago
yangjackie / Topics-on-diffusion-generative-models
☆26Updated 2 months ago
lucasjinreal / LLaVA-Magvit2
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆37Updated last year
bojone / tiger
A Tight-fisted Optimizer
☆48Updated 2 years ago
HelmholtzAI-FZJ / flex_gen
☆17Updated 5 months ago
OpenGVLab / DiffAgent
[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
☆17Updated last year
cheneydon / efficient-bert
This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …
☆33Updated 2 years ago
megvii-research / basedet
An object detection codebase based on MegEngine.
☆28Updated 2 years ago
yuweihao / LV-BERT
LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)
☆18Updated 2 years ago
CyndxAI / QKNorm
Code for the paper "Query-Key Normalization for Transformers"
☆43Updated 4 years ago
donglixp / ICL_PaperList
Paper List for In-context Learning 🌷
☆20Updated 2 years ago
ziplab / SN-Netv2
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆27Updated last year
twinkle0331 / LGTM
[ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…
☆38Updated 2 years ago
shaoshitong / diffusion-model-learning
Document the demo and a series of documents for learning the diffusion model.
☆39Updated last year
ilur98 / DGQ
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
☆14Updated last year
lartpang / RunIt
A simple program scheduler for your code on different devices.
☆11Updated 10 months ago
MarkXCloud / CSpD
The official repo of continuous speculative decoding
☆27Updated 3 months ago
layumi / ICME2022SS
ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”
☆12Updated last year
HDETR / H-PETR-Pose
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Updated 2 years ago
zhixuan-lin / forgetting-transformer
[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"
☆108Updated last month
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆40Updated 2 years ago
LAION-AI / Conditional-Pretraining-of-Large-Language-Models
☆37Updated 2 years ago
Xianchao-Wu / perceiver-pytorch
☆42Updated 4 years ago