Implementation of the dilated self attention as described in "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
ā13Jul 23, 2023Updated 2 years ago
Alternatives and similar repositories for dilated-self-attention
Users that are interested in dilated-self-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- š Fine-tune OpenAI models for text classification, question answering, and moreā17May 1, 2023Updated 2 years ago
- A RAG that can scale š§š»āš»ā11May 28, 2024Updated last year
- Model implementation for the contextual embeddings projectā47Jun 2, 2025Updated 10 months ago
- ā10Oct 2, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddingsā13May 22, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways ⢠AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AVCLASS++: Yet Another Massive Malware Labeling Toolā13Dec 7, 2019Updated 6 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.ā32Sep 19, 2025Updated 7 months ago
- š¤ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)ā17Mar 20, 2024Updated 2 years ago
- A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware iā¦ā29Mar 8, 2026Updated last month
- Starbucks: Improved Training for 2D Matryoshka Embeddingsā23Jun 30, 2025Updated 9 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals ā¦ā15Jul 19, 2024Updated last year
- [NeurIPS 2024] šø GlotCC Dataset and Piplineā20Apr 6, 2025Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologiesā21Oct 24, 2022Updated 3 years ago
- 𧬠Typescript Genetic Algorithm Framework built using denoā12Jun 7, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean ⢠AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Bleeding edge low level Rust binding for GGMLā17Jun 26, 2024Updated last year
- ā15Oct 31, 2023Updated 2 years ago
- Almost SOTA LLM architecture, with O(n) time complexityā11Jan 19, 2025Updated last year
- A transient UI for Cargo, Rust's package managerā11Dec 17, 2025Updated 4 months ago
- ā13Aug 10, 2024Updated last year
- ā20Oct 2, 2024Updated last year
- recreation of the classic drug trading game "dope wars"ā10May 9, 2019Updated 6 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximationā12Jul 31, 2023Updated 2 years ago
- ā15Mar 16, 2026Updated last month
- Managed Database hosting by DigitalOcean ⢠AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"ā716Jan 7, 2024Updated 2 years ago
- Keyphrase Extraction Prototypesā15Nov 24, 2016Updated 9 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.ā47Jul 25, 2023Updated 2 years ago
- Rhythm analysis toolkit in Pythonā13Sep 29, 2023Updated 2 years ago
- Library for evaluating RAG using Nuclia's modelsā18Jul 31, 2024Updated last year
- Smart commit messagesā18Oct 25, 2024Updated last year
- Babel plugin for Regex+ā14Dec 16, 2025Updated 4 months ago
- The source code for the official documentation of PyScript.ā16Mar 2, 2026Updated last month
- Log-structured merge-tree implementation in Rustā19Nov 6, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits ⢠AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- SMS gateway for sending text messagesā13Apr 9, 2023Updated 3 years ago
- š¤ Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.ā17Jun 5, 2025Updated 10 months ago
- š Modular retrievers for zero-shot multilingual IR.ā30Mar 6, 2024Updated 2 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Rankingā25Apr 4, 2025Updated last year
- ā10Mar 29, 2022Updated 4 years ago
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jaxā23Jun 8, 2025Updated 10 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.ā24Sep 24, 2023Updated 2 years ago