FireFlyer Record file format, writer and reader for DL training samples.
☆241Dec 1, 2022Updated 3 years ago
Alternatives and similar repositories for ffrecord
Users that are interested in ffrecord are comparing it to the libraries listed below
Sorting:
- ☆12Feb 16, 2023Updated 3 years ago
- HFAI deep learning models☆162May 25, 2023Updated 2 years ago
- ☆42Jun 10, 2022Updated 3 years ago
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆739Oct 24, 2023Updated 2 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- A high-performance distributed file system designed to address the challenges of AI training and inference workloads.☆9,730Feb 25, 2026Updated last week
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆33Aug 31, 2022Updated 3 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆4,843Updated this week
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Dec 30, 2021Updated 4 years ago
- This repo consist of some experimental results on bdd100k datasets using different object detection algorithms(Faster-RCNN, FCOS, ATSS)☆11Jun 27, 2020Updated 5 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- A lightweight data processing framework built on DuckDB and 3FS.☆4,931Mar 5, 2025Updated 11 months ago
- DeepEP: an efficient expert-parallel communication library☆9,005Feb 9, 2026Updated 3 weeks ago
- Cinrad WSR-98d level 1/2/3 data specification☆12Dec 5, 2025Updated 3 months ago
- GET3D with multi-nodes support☆11Oct 18, 2022Updated 3 years ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Some commonly used functions and modules☆10Jan 15, 2024Updated 2 years ago
- Code for creating a 3D lidar map of Leslie St and Highway 7.☆10Jan 4, 2021Updated 5 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- my bachelor's thesis in SJTU about https://github.com/caicloud/cyclone☆12Jan 4, 2018Updated 8 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 3 years ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- Caffe++: assemble new features to enhance Caffe☕️☆11Dec 24, 2018Updated 7 years ago
- ☆12Jun 29, 2024Updated last year
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆11Feb 12, 2023Updated 3 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Code for Self-and-Collaborative Attention Network from "SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification…☆26Jun 1, 2019Updated 6 years ago
- Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs☆938Nov 27, 2025Updated 3 months ago
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆200Apr 17, 2022Updated 3 years ago
- NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process com…☆469Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Jan 12, 2026Updated last month
- The official implementation of Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation.☆17Sep 19, 2022Updated 3 years ago