Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
β13Jul 23, 2023Updated 2 years ago
Alternatives and similar repositories for dilated-self-attention
Users that are interested in dilated-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π Fine-tune OpenAI models for text classification, question answering, and moreβ17May 1, 2023Updated 3 years ago
- Model implementation for the contextual embeddings projectβ47Jun 2, 2025Updated last year
- β10Oct 2, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numbaβ38Oct 16, 2025Updated 8 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddingsβ13May 22, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- π€ Trade any tensors over the networkβ31Sep 27, 2023Updated 2 years ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Mar 20, 2024Updated 2 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systemsβ36Nov 21, 2025Updated 6 months ago
- Semantically Search Emojis From the Command Line!β13Nov 26, 2023Updated 2 years ago
- πAutomatically Update CV Papers Daily using Github Actions (Update Every 12th hours)β12May 17, 2026Updated last month
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ23Jun 30, 2025Updated 11 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals β¦β15Jul 19, 2024Updated last year
- [NeurIPS 2024] πΈ GlotCC Dataset and Piplineβ20Apr 6, 2025Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesβ21Apr 27, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 𧬠Typescript Genetic Algorithm Framework built using denoβ12Jun 7, 2022Updated 4 years ago
- Bleeding edge low level Rust binding for GGMLβ17Jun 26, 2024Updated last year
- Investigation into whether Transformers and self-supervised learning could be used to trade currency marketsβ10Jun 21, 2023Updated 2 years ago
- β15Oct 31, 2023Updated 2 years ago
- CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Modelsβ16Oct 14, 2024Updated last year
- Developing, training, and assessing the performance of a Proximal Policy Optimization (PPO) Stock Trading Agent.β14Aug 20, 2025Updated 9 months ago
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Modelβ27Oct 12, 2024Updated last year
- Lazily and automatically execute emacs config codeβ10Jan 15, 2017Updated 9 years ago
- recreation of the classic drug trading game "dope wars"β10May 9, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A small spreadsheet demo in Rust, Yew, and WASMβ11Jun 16, 2023Updated 3 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximationβ12Jul 31, 2023Updated 2 years ago
- β16Mar 16, 2026Updated 3 months ago
- Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"β720Jan 7, 2024Updated 2 years ago
- Keyphrase Extraction Prototypesβ15Nov 24, 2016Updated 9 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.β48Jul 25, 2023Updated 2 years ago
- Rhythm analysis toolkit in Pythonβ13Sep 29, 2023Updated 2 years ago
- Library for evaluating RAG using Nuclia's modelsβ18Jul 31, 2024Updated last year
- β13May 6, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- possibly useful materials for learning RWKV language model.β26Jun 8, 2023Updated 3 years ago
- π Modular retrievers for zero-shot multilingual IR.β30Mar 6, 2024Updated 2 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Rankingβ25Apr 4, 2025Updated last year
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jaxβ24Jun 8, 2025Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.β24Sep 24, 2023Updated 2 years ago
- β41May 28, 2026Updated 3 weeks ago
- β15Oct 28, 2020Updated 5 years ago