☆46May 24, 2025Updated last year
Alternatives and similar repositories for multi-latent-attention
Users that are interested in multi-latent-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆45May 4, 2025Updated last year
- ☆14Apr 7, 2025Updated last year
- ☆32Mar 21, 2026Updated 2 months ago
- ☆21Apr 3, 2026Updated last month
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Notes on Direct Preference Optimization☆26Apr 14, 2024Updated 2 years ago
- ☆67Nov 27, 2023Updated 2 years ago
- video description generation vision-language model☆22Jan 21, 2025Updated last year
- Ingestion pipeline for blr.today☆13Updated this week
- My approach to Google foobar challenges.☆12Jun 27, 2020Updated 5 years ago
- Playground for Kotlin Flows and Channels☆30Oct 6, 2020Updated 5 years ago
- ☆251Jan 2, 2025Updated last year
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆47Apr 14, 2026Updated last month
- Course Material for the Tutorial on Privacy Enhancing Technologies and PPML☆13Oct 29, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆57Sep 25, 2025Updated 8 months ago
- repo of paper implementations☆20Feb 25, 2025Updated last year
- Resources, by the people, of the people, for the people.☆14Apr 24, 2021Updated 5 years ago
- ☆17Jun 6, 2024Updated last year
- A Hybrid Self-Cross Attention Network For Remote Sensing Change Detection☆16May 13, 2025Updated last year
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆451Feb 22, 2025Updated last year
- 100 days of building GPU kernels!☆598Apr 27, 2025Updated last year
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆32Jan 3, 2026Updated 4 months ago
- 2026 entry-level data science & ML jobs — analytics, AI, quant & machine learning US roles☆44Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Sep 27, 2024Updated last year
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- code for the paper, "flexible multitask computation in recurrent networks utilizes shared dynamical motifs"☆12Mar 2, 2025Updated last year
- Movie Hangman Game - Jetpack Compose☆11Jan 24, 2025Updated last year
- Apply LLM and TTS for a rss news reader☆16May 30, 2024Updated last year
- Temporal Neural Networks☆30Mar 2, 2026Updated 2 months ago
- Resk is a robust Python library designed to enhance security and manage context when interacting with LLMs. It provides a protective …☆19Apr 13, 2026Updated last month
- Learn CUDA with PyTorch☆303May 13, 2026Updated 2 weeks ago
- TypeScript AI "code mode" toolkit with permissions and search☆63May 1, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official code for PLoP☆20Mar 6, 2026Updated 2 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆24May 18, 2025Updated last year
- TabLeak: Tabular Data Leakage in Federated Learning☆17Jul 4, 2024Updated last year
- Benchmark your GPU with ease☆30Dec 27, 2025Updated 5 months ago
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- ☆28Apr 7, 2025Updated last year