A performance library for machine learning applications.
☆183Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for trident
Users that are interested in trident are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- Performant kernels for symmetric tensors☆16Aug 22, 2024Updated last year
- ☆56Nov 14, 2024Updated last year
- Official implementation of project Honeybee (CVPR 2024)☆467May 10, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆20Nov 6, 2024Updated last year
- ☆23Oct 30, 2023Updated 2 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆484Aug 22, 2023Updated 2 years ago
- Data processing system for polyglot☆93Sep 5, 2023Updated 2 years ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆120Mar 6, 2024Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- ☆90Mar 28, 2024Updated last year
- Study parallel programming - CUDA, OpenMP, MPI, Pthread☆64Jul 3, 2022Updated 3 years ago
- Standalone Nori (Korean Morphological Analyzer)☆42Sep 20, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆19Sep 20, 2022Updated 3 years ago
- OSLO: Open Source for Large-scale Optimization☆175Sep 9, 2023Updated 2 years ago
- ☆55Nov 22, 2022Updated 3 years ago
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- 한국어 LLM 리더보드 및 모델 성능/안전성 관리☆22Sep 26, 2023Updated 2 years ago
- OSLO: Open Source framework for Large-scale model Optimization☆309Aug 25, 2022Updated 3 years ago
- ☆26Dec 5, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20May 30, 2023Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Large-scale language modeling tutorials with PyTorch☆293Nov 2, 2021Updated 4 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Triton kernels for Flux☆22Jul 7, 2025Updated 8 months ago
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,251Nov 30, 2022Updated 3 years ago
- #Paired Question☆24Jun 16, 2020Updated 5 years ago
- Introduction to Deep Learning☆82Nov 29, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 금융 도메인에 특화된 한국어 임베딩 모델☆22Aug 8, 2024Updated last year
- ☆93Mar 3, 2022Updated 4 years ago
- KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding☆310Jul 9, 2023Updated 2 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- Curation note of NLP datasets☆98Dec 6, 2022Updated 3 years ago
- PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs☆633Aug 9, 2022Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago