A performance library for machine learning applications.
☆183Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for trident
Users that are interested in trident are comparing it to the libraries listed below
Sorting:
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Jul 16, 2023Updated 2 years ago
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Performant kernels for symmetric tensors☆16Aug 22, 2024Updated last year
- Official implementation of project Honeybee (CVPR 2024)☆465May 10, 2024Updated last year
- Serving Example of CodeGen-350M-Mono-GPTJ on Triton Inference Server with Docker and Kubernetes☆20May 30, 2023Updated 2 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆484Aug 22, 2023Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- ☆23Oct 30, 2023Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)☆10Feb 21, 2023Updated 3 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- Data processing system for polyglot☆93Sep 5, 2023Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- Dependency Parsing as Sequence Labeling with Python3+ and PyTorch1+ and MTL☆10Nov 21, 2019Updated 6 years ago
- Various DirectX12 examples.☆23Nov 30, 2020Updated 5 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- Experimental implementation of OpenCL over Metal☆12Jul 20, 2022Updated 3 years ago
- 한국어 중의성 해소 평가 데이터 세트☆53Dec 29, 2025Updated 2 months ago
- ☆89Mar 28, 2024Updated last year
- High-performance vector search engine with no loss of accuracy through GPU and dynamic placement☆31Jul 12, 2025Updated 7 months ago
- ☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM☆589May 1, 2024Updated last year
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- Chatbot using Tensorflow (Model is transformer) ko☆30Dec 10, 2018Updated 7 years ago
- ☆106May 8, 2023Updated 2 years ago
- OSLO: Open Source for Large-scale Optimization☆175Sep 9, 2023Updated 2 years ago
- #Paired Question☆24Jun 16, 2020Updated 5 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- 금융 도메인에 특화된 한국어 임베딩 모델☆23Aug 8, 2024Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Large-scale language modeling tutorials with PyTorch☆292Nov 2, 2021Updated 4 years ago
- [Google Meet] MLLM Arxiv Casual Talk☆52Mar 16, 2023Updated 2 years ago
- Standalone Nori (Korean Morphological Analyzer)☆42Sep 20, 2023Updated 2 years ago
- Triton kernels for Flux☆22Jul 7, 2025Updated 7 months ago
- 삼각형의 실전! CMake☆16Jun 27, 2024Updated last year
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago