hoangthangta / ThangKAN
KANs for text classification on GLUE tasks
☆9Updated 9 months ago
Alternatives and similar repositories for ThangKAN
Users that are interested in ThangKAN are comparing it to the libraries listed below
Sorting:
- ☆8Updated 6 months ago
- C++ and Cuda ops for fused FourierKAN☆78Updated last year
- This code implements a Radial Basis Function (RBF) based Kolmogorov-Arnold Network (KAN) for function approximation.☆28Updated 11 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- ☆88Updated 11 months ago
- FC-KAN: Function Combinations in Kolmogorov-Arnold Networks☆33Updated this week
- efficient query encoding for dense retrieval☆11Updated 9 months ago
- Combine B-Splines (BS) and Radial Basis Functions (RBF) in Kolmogorov-Arnold Networks (KANs)☆26Updated last week
- ☆14Updated 11 months ago
- MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency☆30Updated last year
- ☆10Updated last year
- ☆45Updated last year
- We study toy models of skill learning.☆26Updated 4 months ago
- ☆11Updated 9 months ago
- An implementation of mLSTM and sLSTM in PyTorch.☆28Updated 11 months ago
- Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…☆26Updated 9 months ago
- Pytorch (Lightning) implementation of the Mamba model☆28Updated last month
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 3 months ago
- Kolmogorov–Arnold Networks (KAN) in PyTorch☆23Updated last year
- ☆25Updated last year
- Official implementation of "BERTs are Generative In-Context Learners"☆27Updated 2 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆17Updated 7 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆24Updated 3 weeks ago
- Variations of Kolmogorov-Arnold Networks☆114Updated last year
- ☆33Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 8 months ago
- Generative Reranker PyTerrier☆14Updated last month
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated 3 weeks ago
- ☆22Updated 3 months ago
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆11Updated 2 years ago