nate-gillman / fourier-headLinks
Official implementation of "Fourier Head: Helping Large Language Models Learn Complex Probability Distributions" (ICLR 2025)
☆64Updated 3 months ago
Alternatives and similar repositories for fourier-head
Users that are interested in fourier-head are comparing it to the libraries listed below
Sorting:
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆105Updated 8 months ago
- ☆60Updated 3 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆116Updated last month
- Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.☆175Updated 2 years ago
- Code repository for Trajectory Flow Matching☆72Updated 8 months ago
- The official code 👩💻 for - TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis☆331Updated 4 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆224Updated last week
- This code implements a Radial Basis Function (RBF) based Kolmogorov-Arnold Network (KAN) for function approximation.☆29Updated last year
- This repository contains a better implementation of Kolmogorov-Arnold networks☆62Updated last month
- A simple example of VAEs with KANs☆12Updated last year
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆82Updated last year
- ☆31Updated last year
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆81Updated 11 months ago
- A State-Space Model with Rational Transfer Function Representation.☆79Updated last year
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆129Updated last year
- Tabular In-Context Learning☆78Updated 4 months ago
- ☆78Updated last year
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆38Updated 3 months ago
- A More Fair and Comprehensive Comparison between KAN and MLP☆171Updated 10 months ago
- Kolmogorov-Arnold Networks with various basis functions like B-Splines, Fourier, Chebyshev, Wavelets etc☆35Updated last year
- Graph neural networks in JAX.☆67Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆120Updated last year
- Conformal Prediction for Time Series with Modern Hopfield Networks☆79Updated last month
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆86Updated this week
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆65Updated last month
- Patched Attention for Nonlinear Dynamics☆150Updated 2 weeks ago
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆66Updated this week
- Diffusion model derived evolutionary algorithm☆214Updated last month
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆97Updated 6 months ago
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆70Updated last month