☆22Jul 11, 2023Updated 2 years ago
Alternatives and similar repositories for fastertransformer_backend
Users that are interested in fastertransformer_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆412Nov 11, 2023Updated 2 years ago
- ☆19Jun 4, 2021Updated 4 years ago
- ☆11Updated this week
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- Implementation of algorithms for memory optimized deep neural network training☆10Jul 23, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- ☆128Dec 24, 2024Updated last year
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆16Jul 29, 2023Updated 2 years ago
- llama inference for tencentpretrain☆99Jun 8, 2023Updated 2 years ago
- /j f t/ - YAML file tool☆14Apr 28, 2026Updated 3 weeks ago
- TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pyt…☆16Jul 5, 2024Updated last year
- Implementation of vDNN++; an improvement over vDNN☆18Dec 7, 2018Updated 7 years ago
- Transformer related optimization, including BERT, GPT☆39Feb 10, 2023Updated 3 years ago
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- AI生成简单音乐 —— 使用ChatGPT稳定生成并播放 | AI作曲 | AI写歌☆17Mar 3, 2024Updated 2 years ago
- ☆14Aug 21, 2025Updated 9 months ago
- Recurrent Covolutional Neual Network implementation in TF2.0☆13Mar 25, 2023Updated 3 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- Python code for "Bayesian hybrid matrix factorisation for data integration", published at 20th International Conference on Artificial Int…☆13Jun 10, 2018Updated 7 years ago
- 能够远程办公(work from home)的公司名单☆16Mar 2, 2022Updated 4 years ago
- the completion of CNNs by myself☆14Oct 8, 2015Updated 10 years ago
- 法研杯犯罪金额提取☆14Mar 5, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 同花顺算法挑战平台:【9-10双月赛】跨领域迁移的文本语义匹配☆11Oct 28, 2021Updated 4 years ago
- Large Language Model (LLM) Serving Paper and Resource List☆28Apr 16, 2026Updated last month
- 大模型意图识别☆11Aug 14, 2024Updated last year
- 基于自由度(熵)、凝固度 新词发现算法实现☆12Oct 7, 2018Updated 7 years ago
- Adapted iPerf3 iOS sample☆12Mar 15, 2017Updated 9 years ago
- “悟道”源代码☆21Aug 24, 2021Updated 4 years ago
- QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.☆155Aug 21, 2025Updated 9 months ago
- ⚡ boost inference speed of GPT models in transformers by onnxruntime☆51Aug 20, 2023Updated 2 years ago
- ☆17May 1, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Lock tailing on your rotating files☆12Dec 4, 2019Updated 6 years ago
- DataSciCamp — Data Science Challenge / Competition Deadlines☆15May 26, 2020Updated 5 years ago
- 电子病历结构化解析☆13May 11, 2022Updated 4 years ago
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated last month
- Distributed ML Training Benchmarks☆27Mar 1, 2023Updated 3 years ago
- Awesome repositories for LLaMA1 and LLaMA2☆18Jul 25, 2024Updated last year
- Deploy Kubernetes Using OpenStack Ironic☆11Jul 27, 2017Updated 8 years ago