IDSIA/lmtool-fwp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IDSIA/lmtool-fwp)

IDSIA / lmtool-fwp

PyTorch Language Modeling Toolkit for Fast Weight Programmers

☆22

Alternatives and similar repositories for lmtool-fwp

Users that are interested in lmtool-fwp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / fast-weight-attention
View on GitHub
Implementation of Fast Weight Attention
☆33Jun 3, 2026Updated last month
merope82 / JWIms
View on GitHub
☆12Jul 17, 2022Updated 4 years ago
ischlag / fast-weight-transformers
View on GitHub
Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.
☆115Jun 10, 2021Updated 5 years ago
showgood / onlisp
View on GitHub
Paul Graham's onlisp book in org mode format
☆22May 11, 2024Updated 2 years ago
quantified-uncertainty / ai-safety-papers
View on GitHub
☆22Sep 9, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
makokal / MDPN
View on GitHub
Unified notation for Markov Decision Processes PO(MDP)s
☆24Apr 27, 2018Updated 8 years ago
ischlag / Fast-Weight-Memory-public
View on GitHub
Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.
☆30Feb 25, 2021Updated 5 years ago
MicroSTM / AGENT-synthesis
View on GitHub
Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"
☆24Mar 3, 2022Updated 4 years ago
IDSIA / recurrent-fwp
View on GitHub
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆52Jun 11, 2025Updated last year
fuatkina / Advanced-Data-Analysis-with-Python
View on GitHub
☆16Jan 5, 2023Updated 3 years ago
JONNY-ME / Microsoft-Rice-Disease-Classification-Challenge
View on GitHub
Identifing disease types in images of rice grown in Egypt.
☆25Aug 15, 2022Updated 3 years ago
codebender828 / miro-landing-chakra
View on GitHub
Clone of Miro's landing page made with Chakra UI Vue
☆12Sep 8, 2020Updated 5 years ago
mmrezaee / VRTM
View on GitHub
"A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)
☆11Apr 26, 2021Updated 5 years ago
Smerity / pytorch-lamb
View on GitHub
Implementation of https://arxiv.org/abs/1904.00962
☆15Aug 30, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
deep-spin / OpenNMT-entmax
View on GitHub
☆15May 14, 2019Updated 7 years ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
RobertCsordas / onion_representations
View on GitHub
☆13Aug 19, 2024Updated last year
proger / uk4b
View on GitHub
GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian
☆20Aug 6, 2023Updated 2 years ago
rloganiv / kglm-data
View on GitHub
Code used to create the Linked WikiText-2 dataset
☆16May 22, 2023Updated 3 years ago
danhett / algoraveconduct
View on GitHub
A concise open code of conduct for live Algorave events
☆16Jun 14, 2019Updated 7 years ago
dmitrymailk / mt_bench_ru
View on GitHub
☆10Jan 16, 2024Updated 2 years ago
BICLab / MetaLA
View on GitHub
Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)
☆36Jan 18, 2025Updated last year
serdarozsoy / corinfomax-ssl
View on GitHub
PyTorch implementation of CorInfoMax
☆23Dec 26, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
allenai / staged-training
View on GitHub
Staged Training for Transformer Language Models
☆33Mar 31, 2022Updated 4 years ago
victordibia / cocoafrica
View on GitHub
A Curation Tool and Dataset of Common Objects in the Context of Africa
☆18May 1, 2023Updated 3 years ago
divyanshj16 / SPADE
View on GitHub
"Semantic Image Synthesis with Spatially-Adaptive Normalization" paper implementation
☆69Jul 19, 2019Updated 7 years ago
njuzrs / dialogue_distillation
View on GitHub
☆15Nov 3, 2022Updated 3 years ago
nunogois / game-search-expo
View on GitHub
Mobile app that lets you search for games. Using IGDB and built with Expo (React Native)
☆17Jul 6, 2021Updated 5 years ago
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
samrat / ecstatic
View on GitHub
ecstatic creates static web pages and blog posts from Hiccup templates and Markdown.
☆21Jan 28, 2014Updated 12 years ago
unlp-workshop / unlp-2025-shared-task
View on GitHub
UNLP 2025 Shared Task on Detecting Social Media Manipulation
☆23Aug 4, 2025Updated 11 months ago
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SongYanSDU / AugANFIS
View on GitHub
Single-Source Domain Generalization for Bearing Fault Diagnosis Using Feature-Augmented Adaptive Neuro-Fuzzy Inference System
☆12Apr 13, 2024Updated 2 years ago
turtle261 / infotheory
View on GitHub
An Algorithmic Information Theory Library (And Information-theory broadly); Implements numerous approximations, estimations, as well as a…
☆17Jun 30, 2026Updated 3 weeks ago
NiyunZhou / The21-dayExpendables
View on GitHub
We are the 21-day expandables of a kaggle competition.
☆15Jul 23, 2017Updated 9 years ago
philipturner / apm-roadmap
View on GitHub
A Nanofactory Roadmap 2: Improved Proposal for a Comprehensive Diamondoid Nanofactory Development Program
☆18Jul 24, 2025Updated 11 months ago
RUCAIBox / MPOP
View on GitHub
☆13Jun 16, 2021Updated 5 years ago
lsj2408 / URPE
View on GitHub
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆35Aug 6, 2023Updated 2 years ago
KamikaziZen / RunLoRA
View on GitHub
Faster and Lighter LoRA Implementations
☆13Nov 21, 2024Updated last year