☆46May 24, 2025Updated last year
Alternatives and similar repositories for multi-latent-attention
Users that are interested in multi-latent-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆45May 4, 2025Updated last year
- BERT explained from scratch☆18Oct 26, 2023Updated 2 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 7 months ago
- ☆32Mar 21, 2026Updated 2 months ago
- Code for data-aware compression of DeepSeek models☆75Dec 11, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆67Nov 27, 2023Updated 2 years ago
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆22Sep 21, 2024Updated last year
- Notes on Direct Preference Optimization☆28Apr 14, 2024Updated 2 years ago
- ☆10May 23, 2026Updated 3 weeks ago
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆47Apr 14, 2026Updated 2 months ago
- Transformer + GAT for RNA chemical reactivity prediction| Stanford Ribonanza☆11Jan 28, 2026Updated 4 months ago
- A Hybrid Self-Cross Attention Network For Remote Sensing Change Detection☆16May 13, 2025Updated last year
- The docs about the crystaline coin are collected here!☆10May 12, 2025Updated last year
- 100 days of building GPU kernels!☆602Apr 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆32Jan 3, 2026Updated 5 months ago
- 2026 entry-level data science & ML jobs — analytics, AI, quant & machine learning US roles☆47Updated this week
- ☆11Sep 27, 2024Updated last year
- Movie Hangman Game - Jetpack Compose☆11Jan 24, 2025Updated last year
- Apply LLM and TTS for a rss news reader☆16May 30, 2024Updated 2 years ago
- Code for the paper "Neural mechanisms of relational learning and fast knowledge reassembly in plastic neural networks" (Miconi & Kay, Nat…☆13May 28, 2025Updated last year
- Temporal Neural Networks☆30Mar 2, 2026Updated 3 months ago
- Resk is a robust Python library designed to enhance security and manage context when interacting with LLMs. It provides a protective …☆20Jun 6, 2026Updated last week
- Official code for PLoP☆20Mar 6, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆63Jan 26, 2026Updated 4 months ago
- ☆13May 23, 2024Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- ☆41Apr 9, 2025Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- ☆43Dec 15, 2025Updated 6 months ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,220Aug 26, 2025Updated 9 months ago
- ☆10Nov 27, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python version of "How to Build an Agent" by Thorsten Ball☆49Oct 30, 2025Updated 7 months ago
- Because it's there.☆16Sep 22, 2024Updated last year
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆11Dec 24, 2023Updated 2 years ago
- ☆15Mar 8, 2021Updated 5 years ago
- Deep Imputation for Skeleton data☆33May 26, 2026Updated 3 weeks ago
- Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL☆46Apr 7, 2026Updated 2 months ago
- A terminal dashboard for Pipecat☆54Mar 31, 2026Updated 2 months ago