High-performance LLM inference based on our optimized version of FastTransfomer
☆122Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆716Dec 30, 2024Updated last year
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆109Apr 28, 2025Updated last year
- ☆41Oct 17, 2024Updated last year
- Code for ASE'21 paper "AID: Efficient Prediction of Aggregated Intensity of Dependency in Large-scale Cloud Systems"☆15Nov 2, 2021Updated 4 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Runtimex package help to expose Go Runtime internals representation safely.☆12Feb 19, 2025Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆177Aug 15, 2025Updated 8 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- AI Native IDE based on CodeFuse and OpenSumi☆286Dec 3, 2025Updated 4 months ago
- Collection of usefull scripts for RunPod pods☆15Jan 26, 2024Updated 2 years ago
- ☆64Jan 16, 2025Updated last year
- ☆27Dec 11, 2025Updated 4 months ago
- ☆127Apr 22, 2023Updated 3 years ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Sep 23, 2022Updated 3 years ago
- os lectures 2022 spring☆10Aug 25, 2025Updated 8 months ago
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- ☆14May 19, 2023Updated 2 years ago
- a simple theme for Hexo☆30Dec 17, 2023Updated 2 years ago
- The project covers common metrics for super-resolution performance evaluation.☆12Dec 27, 2021Updated 4 years ago
- ☆21May 26, 2025Updated 11 months ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolk…☆1,287Jul 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CausIL is an approach to estimate the causal graph for a cloud microservice system, where the nodes are the service-specific metrics whil…☆13Jul 3, 2023Updated 2 years ago
- ☆10Mar 2, 2024Updated 2 years ago
- Finetune baichuan pretrained model with QLora method☆16Jul 13, 2023Updated 2 years ago
- Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.☆652Jul 10, 2024Updated last year
- pure go for rwkv☆18Dec 31, 2023Updated 2 years ago
- ☆18Apr 14, 2021Updated 5 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆34Jul 23, 2024Updated last year
- A minimalist Go implementation of Microsoft's GraphRAG☆22Aug 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Uniapp使用GoEasy实现websocket实时通讯☆15Mar 26, 2020Updated 6 years ago
- Go language bindings for ONNX runtime☆18Apr 15, 2020Updated 6 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- A curated paper list on LLM reasoning.☆90Mar 4, 2024Updated 2 years ago
- ☆330Jul 25, 2024Updated last year
- 一款基于Gin+Vue+ElementUI的前后端分离权限管理系统,以 Golang、Gin、Xorm、Vue、ElementUI、MySQL等技术栈开发平台框架,拥有完善的(RBAC)权限架构和基础核心管理模块,为了缩短研发周期,系统框架集成了代码生成器,内置平台自定义研…☆19May 19, 2022Updated 3 years ago
- SDXL LCM Multi-controlnet with loras☆15Dec 11, 2023Updated 2 years ago