☆20Nov 23, 2022Updated 3 years ago
Alternatives and similar repositories for torchdynamo-tests
Users that are interested in torchdynamo-tests are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jan 2, 2022Updated 4 years ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆19Mar 21, 2024Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A tracing JIT compiler for PyTorch☆14Dec 11, 2021Updated 4 years ago
- Learn Rust on AWS or Learn AWS with Rust. Do whatever you would like.☆28Sep 2, 2021Updated 4 years ago
- Various transformers for FSDP research☆38Nov 11, 2022Updated 3 years ago
- ☆23Jun 18, 2024Updated last year
- ☆13Mar 27, 2020Updated 6 years ago
- Learning PyTorch through the D2L book. A series of notebooks for the same☆28Jun 30, 2022Updated 3 years ago
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆24Jul 3, 2022Updated 3 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- 데이터와 모델로 채우는 모두를 위한 AI 허브 가든☆35Jul 4, 2025Updated 9 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Dec 14, 2023Updated 2 years ago
- ☆34Apr 23, 2023Updated 2 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 4 months ago
- Example of using next.js, nextauth.js and typescript for both anonymous sessions and authenticated sessions☆10Feb 6, 2024Updated 2 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆42Jul 30, 2021Updated 4 years ago
- Use Actions to acquire those precious lambda GPUs☆19Sep 7, 2023Updated 2 years ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated 11 months ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Dec 15, 2022Updated 3 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Updated this week
- ☆21Mar 3, 2025Updated last year
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 7 months ago
- Prototype routines for GPU quantization written using PyTorch.☆21Updated this week
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- ☆16Aug 10, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Jul 7, 2024Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆496Mar 8, 2023Updated 3 years ago
- Leverage your LangChain trace data for fine tuning☆46Aug 2, 2024Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp models☆50Mar 22, 2022Updated 4 years ago
- Inference API server with echo and gRPC to triton server (golang)☆13Nov 16, 2022Updated 3 years ago
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆14Jan 7, 2025Updated last year