Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
☆13Jul 23, 2023Updated 2 years ago
Alternatives and similar repositories for dilated-self-attention
Users that are interested in dilated-self-attention are comparing it to the libraries listed below
Sorting:
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- Model implementation for the contextual embeddings project☆41Jun 2, 2025Updated 9 months ago
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- Library for Excel-like calculations with some additional features like Calculation Graph and Custom Functions.☆10Jan 21, 2016Updated 10 years ago
- ☆18Updated this week
- ☆15Oct 24, 2023Updated 2 years ago
- ☆10Oct 2, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Python wrapper for the energy system optimization framework IESopt.☆18Mar 2, 2026Updated last week
- Multitouch gestures on X11, Linux☆10Nov 22, 2015Updated 10 years ago
- ☆10May 9, 2016Updated 9 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- ☆14Nov 12, 2025Updated 3 months ago
- Bridging Large Language Models with Scala 3 Functions☆11Aug 31, 2024Updated last year
- Repository for sample Windows applications and tools that use eye tracking☆21Aug 18, 2022Updated 3 years ago
- A Kafka mirroring service based on Akka Streams Kafka☆10Jan 30, 2022Updated 4 years ago
- ☆10Jan 25, 2022Updated 4 years ago
- ☆11Apr 17, 2023Updated 2 years ago
- A visualization tool to support reviewing the scientific literature☆14Jun 2, 2018Updated 7 years ago
- Ruby wrapper for the arXiv API☆27Apr 30, 2024Updated last year
- Almost SOTA LLM architecture, with O(n) time complexity☆11Jan 19, 2025Updated last year
- Debug as an Effect (DaaE)☆10Apr 22, 2025Updated 10 months ago
- Omgrofl interpreter☆16Oct 1, 2020Updated 5 years ago
- A small framework for web apps using http4s+tapir+laminar. Currently for personal use but may grow into a thing later down the line.☆11Updated this week
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆24Dec 15, 2025Updated 2 months ago
- Data used in Climate Indicator Project figures and tables☆15Jun 26, 2025Updated 8 months ago
- s_mach.datadiff is an open-source data difference engine for Scala. Implementations of the DataDiff type-class are provided which can com…☆10Feb 3, 2017Updated 9 years ago
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.☆14Aug 20, 2025Updated 6 months ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆12Jun 5, 2024Updated last year
- ☆10Jun 15, 2018Updated 7 years ago
- ☆10Nov 29, 2018Updated 7 years ago
- Experiments on Linking the Nodes of a Music Notation Graph (MuNG) with Deep Learning.☆12May 16, 2021Updated 4 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆36Oct 16, 2025Updated 4 months ago
- ☆11Feb 4, 2024Updated 2 years ago
- LCA as Code - Domain-Specific Language for Life-Cycle Analysis☆15Oct 1, 2025Updated 5 months ago
- Generate back-links in Evernote notes☆10Feb 5, 2018Updated 8 years ago
- Rust widget toolkit built on Reclutch☆11Mar 25, 2020Updated 5 years ago
- Resources used by all of the autometrics implementations☆14Dec 5, 2023Updated 2 years ago