A Pytorch implementation of "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings"
☆10Apr 20, 2022Updated 3 years ago
Alternatives and similar repositories for AGG
Users that are interested in AGG are comparing it to the libraries listed below
Sorting:
- [ICLR'23] New Insights for the Stability-Plasticity Dilemma in Online Continual Learning☆20Feb 14, 2023Updated 3 years ago
- ☆31Oct 15, 2021Updated 4 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆67Dec 28, 2020Updated 5 years ago
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆94Dec 3, 2024Updated last year
- Code for CIKM 2021 best short paper nomination "Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation" https…☆16Jun 11, 2021Updated 4 years ago
- ☆18Jun 16, 2025Updated 8 months ago
- Simple ranking metrics for PyTorch on CPU or GPU☆15Nov 20, 2020Updated 5 years ago
- Contrastive Learning with Model Augmentation☆18Aug 3, 2022Updated 3 years ago
- [ WSDM '22 ] On Sampling Collaborative Filtering Datasets☆20Jan 13, 2022Updated 4 years ago
- ☆21Nov 23, 2021Updated 4 years ago
- Official code for "Probabilistic Concept Bottleneck Models (ICML 2023)"☆19Aug 14, 2023Updated 2 years ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆137Aug 17, 2023Updated 2 years ago
- HexaGAN: Generative Adversarial Nets for Real World Classification (ICML 2019)☆21Aug 26, 2021Updated 4 years ago
- ☆25Jul 24, 2020Updated 5 years ago
- Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization (NeurIPS 21')☆23Dec 9, 2021Updated 4 years ago
- Graph Trend Filtering Networks for Recommendations, SIGIR'2022☆27Apr 5, 2022Updated 3 years ago
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 11 months ago
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- Official code for "Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization (CVPR 2022)"☆29Aug 14, 2023Updated 2 years ago
- Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)☆88Oct 22, 2021Updated 4 years ago
- ☆13Jun 18, 2025Updated 8 months ago
- ☆80Nov 28, 2022Updated 3 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems☆10Mar 15, 2023Updated 2 years ago
- This is the code of paper: Robust Mid-Pass Filtering Graph Convolutional Networks.(paper accepted by WWW2023)☆13Feb 17, 2023Updated 3 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Graphical intuition to MOSFET square-law☆11Jan 5, 2021Updated 5 years ago
- Code for AAAI21 paper "Scalable and Explainable 1-Bit Matrix Completion via Graph Signal Learning"☆11Feb 15, 2022Updated 4 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- ☆11Jul 20, 2021Updated 4 years ago
- ☆11Jan 7, 2025Updated last year
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆14Nov 25, 2025Updated 3 months ago
- The official PyTorch implementation of "An Attentional Multi-scale Co-evolving Model for Dynamic Link Prediction" (TheWebConf'23)☆11May 4, 2023Updated 2 years ago
- ☆41Dec 7, 2022Updated 3 years ago
- TensorFlow implementation of our paper: Cross Pairwise Ranking for Unbiased Item Recommendation (WWW'22)☆44Nov 8, 2023Updated 2 years ago
- ☆59Mar 22, 2025Updated 11 months ago
- ☆12Apr 2, 2025Updated 11 months ago
- Material for the course of "Mathematics of Transformer"☆19Aug 3, 2025Updated 7 months ago
- In this project, Basic Machine Learning concepts were built on Desharnais dataset to built a software effort estimation model using a lin…☆10Nov 5, 2018Updated 7 years ago
- Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"☆15Jul 17, 2024Updated last year