Aurora optimizer release
☆139May 27, 2026Updated last week
Alternatives and similar repositories for aurora-release
Users that are interested in aurora-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs☆193May 22, 2026Updated 2 weeks ago
- Engine for collecting, uploading, and downloading model activations☆28Apr 2, 2025Updated last year
- Large DNNs training framework for consumer GPUs☆86Jun 1, 2026Updated last week
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- ☆33Dec 31, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- ☆11Jun 17, 2024Updated last year
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆47Dec 13, 2025Updated 5 months ago
- Repository for "Training Language Models To Explain Their Own Computations"☆22Dec 22, 2025Updated 5 months ago
- Reinforcement Learning for Fault-Tolerant Quantum Circuit Discovery☆17Jan 16, 2026Updated 4 months ago
- ☆11Jun 2, 2022Updated 4 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- Official implementation of Déjà View: Looping Transformers for Multi-View 3D Reconstruction☆160Jun 1, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Oct 2, 2024Updated last year
- ☆13Jun 16, 2021Updated 4 years ago
- decontamination☆33Mar 4, 2026Updated 3 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ☆125May 20, 2026Updated 2 weeks ago
- Model-free policy gradient algorithm for LQR☆10Apr 8, 2020Updated 6 years ago
- ☆248Nov 19, 2025Updated 6 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 8 months ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆23Jun 13, 2025Updated 11 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Basic sample for starting with WebGPU development.☆13Jul 17, 2023Updated 2 years ago
- Compute free-support Wasserstein barycenters exactly☆10Aug 22, 2024Updated last year
- ☆52Mar 31, 2026Updated 2 months ago
- ☆10Jun 8, 2024Updated last year
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- ☆13Feb 7, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated last month
- ☆21Dec 5, 2022Updated 3 years ago
- ☆13Jun 7, 2020Updated 6 years ago
- ☆16May 14, 2024Updated 2 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- ☆17May 31, 2023Updated 3 years ago