☆20Nov 23, 2022Updated 3 years ago
Alternatives and similar repositories for torchdynamo-tests
Users that are interested in torchdynamo-tests are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆20Mar 21, 2024Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated 2 years ago
- NLP Examples using the 🤗 libraries☆40Feb 21, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Learn Rust on AWS or Learn AWS with Rust. Do whatever you would like.☆28Sep 2, 2021Updated 4 years ago
- Various transformers for FSDP research☆38Nov 11, 2022Updated 3 years ago
- ☆24Jun 18, 2024Updated last year
- Learning PyTorch through the D2L book. A series of notebooks for the same☆28Jun 30, 2022Updated 3 years ago
- 데이터와 모델로 채우는 모두를 위한 AI 허브 가든☆36Jul 4, 2025Updated 10 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Dec 14, 2023Updated 2 years ago
- ☆34Apr 23, 2023Updated 3 years ago
- ☆34Feb 1, 2026Updated 3 months ago
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Example of using next.js, nextauth.js and typescript for both anonymous sessions and authenticated sessions☆10Feb 6, 2024Updated 2 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 5 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆42Jul 30, 2021Updated 4 years ago
- Use Actions to acquire those precious lambda GPUs☆19Sep 7, 2023Updated 2 years ago
- Verify conda recipes and packages☆22Aug 26, 2025Updated 8 months ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- ☆10Dec 15, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- Example of Apolle Codegen with TypeScript and React☆14Dec 18, 2018Updated 7 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40Updated this week
- ☆21Mar 3, 2025Updated last year
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 8 months ago
- ☆34May 14, 2025Updated last year
- Black for Python docstrings and reStructuredText (rst).☆18Apr 7, 2023Updated 3 years ago
- Prototype routines for GPU quantization written using PyTorch.☆21Apr 15, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Better Live Text for MacOS☆36Feb 8, 2026Updated 3 months ago
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 4 months ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆19Oct 21, 2024Updated last year
- ☆14Jul 7, 2024Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆499Mar 8, 2023Updated 3 years ago