facebookresearch / cycle_gan_for_complementary_item_recommendationsLinks
Research code of Cycle Generative Adversarial Networks for Complementary Item Recommendations.
☆19Updated 2 years ago
Alternatives and similar repositories for cycle_gan_for_complementary_item_recommendations
Users that are interested in cycle_gan_for_complementary_item_recommendations are comparing it to the libraries listed below
Sorting:
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- ☆37Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 9 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆58Updated last year
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆121Updated 11 months ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆33Updated 2 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆37Updated 6 months ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 4 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆78Updated last year
- ☆31Updated 3 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Updated 2 years ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆27Updated 3 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- ☆16Updated last year
- Implementation of Infini-Transformer in Pytorch☆111Updated 8 months ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆63Updated 3 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆95Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 3 years ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆70Updated 2 years ago
- ☆32Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 11 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆68Updated last year