Transformer related optimization, including BERT, GPT
☆39Feb 10, 2023Updated 3 years ago
Alternatives and similar repositories for FasterTransformer
Users that are interested in FasterTransformer are comparing it to the libraries listed below
Sorting:
- A more efficient GLM implementation!☆54Feb 18, 2023Updated 3 years ago
- ☆57Oct 17, 2023Updated 2 years ago
- ☆22Jul 11, 2023Updated 2 years ago
- Revision of official yolov7-pose to support custom dataset for keypoint detection☆11Nov 12, 2023Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆59Sep 20, 2023Updated 2 years ago
- ☆31Apr 19, 2025Updated 10 months ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆478Mar 15, 2024Updated last year
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- ☆413Nov 11, 2023Updated 2 years ago
- HealthiVert-GAN, a novel deep-learning framework designed to generate pseudo-healthy vertebral images. These images simulate the pre-frac…☆11Nov 3, 2025Updated 3 months ago
- ☆12Sep 25, 2023Updated 2 years ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆33Aug 7, 2025Updated 6 months ago
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 3 years ago
- ANnotation-based ANalysis of Specific Interactions☆10Oct 10, 2025Updated 4 months ago
- Supporting material for Princeton ORF307☆12Jan 14, 2026Updated last month
- ☆10Nov 28, 2022Updated 3 years ago
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- Long Context Research☆26Jan 26, 2026Updated last month
- 基于大语言模型的自动综述生成\nAutomatic Review Generation Method based on Large Language Models☆17Jun 22, 2025Updated 8 months ago
- This code is for converting COCO json annotations to YOLO txt format (which both are common in object detection projects).☆10Feb 19, 2024Updated 2 years ago
- ☆28Jan 5, 2026Updated last month
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Nov 29, 2016Updated 9 years ago
- ☆11Feb 25, 2024Updated 2 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- 基于 tornado, Jinja2, Momoke 的异步 web 框架☆21Aug 23, 2013Updated 12 years ago
- This repository contains the code used in a publication 'Active Learning for Decision-Making from Imbalanced Observational Data', Iiris S…☆11May 14, 2019Updated 6 years ago
- ☆11Jun 10, 2022Updated 3 years ago
- A simplified implementation inspired by Cline☆10Mar 11, 2025Updated 11 months ago
- A repo for demo at PyData NYC 2022☆12Nov 9, 2022Updated 3 years ago
- Introduction to mathematical programming with Pyomo (Python)☆12Oct 4, 2018Updated 7 years ago
- Tutorials for FLAVA model https://arxiv.org/abs/2112.04482☆12Jun 22, 2022Updated 3 years ago
- Italian C++ Conference 2023 Slides☆12Jun 23, 2023Updated 2 years ago
- ☆16Feb 18, 2025Updated last year
- open-source first release (OpenCV, Deepface, YOLOv8, Roboflow)☆13Jan 2, 2025Updated last year
- Kubernetes operator example in Python3☆13Mar 21, 2019Updated 6 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- Functional, typesafe and well-tested JSON RPC client for Bitcoin, Ethereum and Omni full nodes☆12Jun 16, 2020Updated 5 years ago