☆20Nov 23, 2022Updated 3 years ago
Alternatives and similar repositories for torchdynamo-tests
Users that are interested in torchdynamo-tests are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Code for the paper "The Surprising Computational Power of Nondeterministic Stack RNNs" (DuSell and Chiang, 2023)☆19Mar 21, 2024Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated last year
- A tracing JIT compiler for PyTorch☆13Dec 11, 2021Updated 4 years ago
- NLP Examples using the 🤗 libraries☆40Feb 21, 2021Updated 5 years ago
- ☆23Jun 18, 2024Updated last year
- Learning PyTorch through the D2L book. A series of notebooks for the same☆28Jun 30, 2022Updated 3 years ago
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆24Jul 3, 2022Updated 3 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- 데이터와 모델로 채우는 모두를 위한 AI 허브 가든☆35Jul 4, 2025Updated 8 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Dec 14, 2023Updated 2 years ago
- ☆34Apr 23, 2023Updated 2 years ago
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated 11 months ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Mar 14, 2026Updated last week
- Example of Apolle Codegen with TypeScript and React☆14Dec 18, 2018Updated 7 years ago
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- ☆34May 14, 2025Updated 10 months ago
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- Better Live Text for MacOS☆33Feb 8, 2026Updated last month
- ☆14Jul 7, 2024Updated last year
- Leverage your LangChain trace data for fine tuning☆46Aug 2, 2024Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆496Mar 8, 2023Updated 3 years ago
- experiments with inference on llama☆103Jun 6, 2024Updated last year
- Gatsby.js starter with TypeScript and Contentful☆14Jul 8, 2022Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp models☆50Mar 22, 2022Updated 4 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- A composite GitHub Action to login to the HuggingFace Hub☆15Feb 4, 2023Updated 3 years ago
- Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)☆13Jan 7, 2025Updated last year
- ☆15Updated this week
- Automatically derive Python dunder methods for your Rust code☆25Jan 28, 2026Updated last month
- Experiment with OpenAI Whisper on Indonesian Languages☆14Feb 24, 2023Updated 3 years ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆24Mar 11, 2026Updated last week
- ☆16Dec 13, 2020Updated 5 years ago
- ☆28Jan 17, 2025Updated last year
- Collection of python scripts to demonstrate asynchronous programming in python☆11May 22, 2022Updated 3 years ago