High-performance LLM inference based on our optimized version of FastTransfomer
☆122Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆714Dec 30, 2024Updated last year
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆111Apr 28, 2025Updated last year
- ☆41Oct 17, 2024Updated last year
- Code for ASE'21 paper "AID: Efficient Prediction of Aggregated Intensity of Dependency in Large-scale Cloud Systems"☆15Nov 2, 2021Updated 4 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- some demos of Knowledge Distillation in NLP☆23Dec 31, 2020Updated 5 years ago
- Runtimex package help to expose Go Runtime internals representation safely.☆12Feb 19, 2025Updated last year
- Easily optimize generic performance metrics in differentiable learning.☆17Jun 6, 2020Updated 6 years ago
- A powerful Laravel storage driver that enables seamless synchronization of files across multiple disks, with an integrated cache disk for…☆15Nov 11, 2025Updated 7 months ago
- AI Native IDE based on CodeFuse and OpenSumi☆291May 28, 2026Updated last month
- ☆28Dec 11, 2025Updated 6 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆13Jun 10, 2026Updated 2 weeks ago
- Just simple JavaScript framework. Provides support for manipulating with DOM and events handling. Easy for use, optimized for performance…☆11Feb 15, 2017Updated 9 years ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆12Jan 25, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- os lectures 2022 spring☆10Aug 25, 2025Updated 10 months ago
- Keyboard-first dotfiles for terminal-centric development with tmux, Neovim, and coding agents.☆28Updated this week
- A Golang implementation of Keras.☆12Nov 24, 2016Updated 9 years ago
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- Fast Polar Decomposition for Muon☆159Jun 24, 2026Updated last week
- ☆17Apr 29, 2024Updated 2 years ago
- An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolk…☆1,291Jul 1, 2024Updated 2 years ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Google Research☆10Apr 20, 2022Updated 4 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆478Mar 15, 2024Updated 2 years ago
- Finetune baichuan pretrained model with QLora method☆15Jul 13, 2023Updated 2 years ago
- Backpack Attachments is a FiveM resource for attaching weapons and items to players' backs. It supports customizable attachment points, h…☆10Nov 14, 2024Updated last year
- Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.☆656Jul 10, 2024Updated last year
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- Demo repository for article "Express server, Handlebars & Critical Path Performance Optimization"☆13Jan 12, 2017Updated 9 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Fastllm-based chatbot☆11May 19, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Uniapp使用GoEasy实现websocket实时通讯☆15Mar 26, 2020Updated 6 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11May 4, 2024Updated 2 years ago
- Stable Magisk modules for performance and efficient battery usage on rooted Android devices.☆32Jun 18, 2026Updated last week
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- A SystemVerilog-based simulation and design of a Last Level Cache (LLC) implementing the MESI protocol, featuring Pseudo-LRU replacement,…☆16Mar 8, 2026Updated 3 months ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆30Jun 5, 2025Updated last year
- Sekai Viewer but built with Next, optimized for performance☆11Jan 20, 2023Updated 3 years ago