Learning to Encode Position for Transformer with Continuous Dynamical Model
☆59Aug 3, 2020Updated 5 years ago
Alternatives and similar repositories for FLOATER
Users that are interested in FLOATER are comparing it to the libraries listed below
Sorting:
- Contact-Aware Symplectic Integrator Network☆16Mar 22, 2023Updated 2 years ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 4 years ago
- Meta learning for generative models.☆16Jul 24, 2019Updated 6 years ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- Code for the paper "A Fully Hyperbolic Neural Model for Hierarchical Multi-class Classification"☆17Nov 17, 2020Updated 5 years ago
- A huge dataset for Document Visual Question Answering☆20Jul 29, 2024Updated last year
- ☆25Jan 22, 2024Updated 2 years ago
- Equivariant Mesh Attention Networks☆20Aug 30, 2022Updated 3 years ago
- ☆21Jun 20, 2019Updated 6 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆252Nov 8, 2021Updated 4 years ago
- Neural Graph Differential Equations (Neural GDEs)☆213Apr 15, 2021Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆114Jun 10, 2021Updated 4 years ago
- Trading Positional Complexity vs Deepness in Coordinate Networks☆31Sep 2, 2023Updated 2 years ago
- ☆39Feb 7, 2025Updated last year
- ☆13Dec 9, 2020Updated 5 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆782Dec 16, 2023Updated 2 years ago
- ☆27Jun 23, 2020Updated 5 years ago
- ODE2VAE: Deep generative second order ODEs with Bayesian neural networks☆132Aug 27, 2024Updated last year
- DisCo Transformer for Non-autoregressive MT☆77Jul 28, 2022Updated 3 years ago
- ☆38May 20, 2021Updated 4 years ago
- ☆10Sep 29, 2023Updated 2 years ago
- Source code for the "Computationally Tractable Riemannian Manifolds for Graph Embeddings" paper☆37Jun 11, 2020Updated 5 years ago
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 5 years ago
- Code for efficiently sampling functions from GP(flow) posteriors☆74Nov 13, 2020Updated 5 years ago
- The official implementation of ''Can Graph Neural Networks Count Substructures?'' NeurIPS 2020☆35Mar 3, 2021Updated 5 years ago
- This contains the source code of Paper "Connecting Embeddings for Knowledge Graph Entity Typing", which is accepted in ACL 2020.☆42Jul 22, 2020Updated 5 years ago
- Code for the paper "Improving Missing Data Imputation with Deep Generative Models"☆32May 8, 2019Updated 6 years ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Feb 27, 2025Updated last year
- [EMNLP 2020] Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading☆38Nov 22, 2022Updated 3 years ago
- Solving High Dimensional Partial Differential Equations with Deep Neural Networks☆34Dec 21, 2021Updated 4 years ago
- Foundation Model for Probabilistic Electricity Price Forecasting☆19Sep 29, 2025Updated 5 months ago
- ☆10Dec 11, 2021Updated 4 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- ☆15Mar 20, 2025Updated 11 months ago
- Water wave models in one dimension☆10Feb 24, 2026Updated last week
- ☆11May 24, 2024Updated last year