☆46May 24, 2025Updated 11 months ago
Alternatives and similar repositories for multi-latent-attention
Users that are interested in multi-latent-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BERT explained from scratch☆17Oct 26, 2023Updated 2 years ago
- Contrastive Reinforcement Learning☆63Apr 4, 2026Updated last month
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- ☆31Apr 29, 2026Updated last week
- Notes on Direct Preference Optimization☆25Apr 14, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆67Nov 27, 2023Updated 2 years ago
- video description generation vision-language model☆21Jan 21, 2025Updated last year
- My approach to Google foobar challenges.☆12Jun 27, 2020Updated 5 years ago
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆46Apr 14, 2026Updated 3 weeks ago
- ☆10Apr 21, 2025Updated last year
- ☆10Feb 27, 2026Updated 2 months ago
- ☆253Jan 2, 2025Updated last year
- Seldon Core Operator for Kubernetes☆13Nov 5, 2019Updated 6 years ago
- Course Material for the Tutorial on Privacy Enhancing Technologies and PPML☆13Oct 29, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- GO-JEK Challenge☆11Jul 9, 2018Updated 7 years ago
- ☆15May 11, 2025Updated 11 months ago
- Resources, by the people, of the people, for the people.☆14Apr 24, 2021Updated 5 years ago
- simple NMT With Attention For Arabic to English☆11Mar 5, 2022Updated 4 years ago
- ☆18Oct 22, 2024Updated last year
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆450Feb 22, 2025Updated last year
- 100 days of building GPU kernels!☆596Apr 27, 2025Updated last year
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆31Jan 3, 2026Updated 4 months ago
- 2026 entry-level data science & ML jobs — analytics, AI, quant & machine learning US roles☆40May 1, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- ☆11Sep 27, 2024Updated last year
- Implementation of Dat2Vec2.0 for vision☆18Feb 6, 2023Updated 3 years ago
- Temporal Neural Networks☆29Mar 2, 2026Updated 2 months ago
- Using Generative Adversarial Networks (GANs) to produce awesome looking fantasy maps☆14May 7, 2021Updated 5 years ago
- TypeScript AI "code mode" toolkit with permissions and search☆63May 1, 2026Updated last week
- Python version of "How to Build an Agent" by Thorsten Ball☆45Oct 30, 2025Updated 6 months ago
- A straightforward method to reduce your LLM inference API costs and token usage.☆24May 18, 2025Updated 11 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TabLeak: Tabular Data Leakage in Federated Learning☆17Jul 4, 2024Updated last year
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆16Apr 22, 2025Updated last year
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- ☆40Apr 9, 2025Updated last year
- ☆29Apr 7, 2025Updated last year
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year