High-performance LLM inference based on our optimized version of FastTransfomer
☆122Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆715Dec 30, 2024Updated last year
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆110Apr 28, 2025Updated last year
- ☆41Oct 17, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- some demos of Knowledge Distillation in NLP☆23Dec 31, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Easily optimize generic performance metrics in differentiable learning.☆18Jun 6, 2020Updated 6 years ago
- An unofficial Mathe System for NEU.☆29Jun 28, 2017Updated 8 years ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆179Aug 15, 2025Updated 9 months ago
- ☆62Jun 17, 2024Updated last year
- A powerful Laravel storage driver that enables seamless synchronization of files across multiple disks, with an integrated cache disk for…☆15Nov 11, 2025Updated 6 months ago
- 模拟东北大学教务处网站登录 并获取全部学生信息 目前可能随着教务处网站的更新变得不可用☆11Mar 2, 2019Updated 7 years ago
- ☆66Jan 16, 2025Updated last year
- ☆28Dec 11, 2025Updated 6 months ago
- ☆127Apr 22, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is Microsoft-Phi-3-NvidiaNIMWorkshop☆22Aug 16, 2024Updated last year
- 纯c++的全平台llm 加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Updated this week
- Ling-Coder-Lite is a MoE LLM provided and open-sourced by CodeFuse and InclusionAI.☆15Apr 22, 2025Updated last year
- Just simple JavaScript framework. Provides support for manipulating with DOM and events handling. Easy for use, optimized for performance…☆11Feb 15, 2017Updated 9 years ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆16Sep 23, 2022Updated 3 years ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆12Jan 25, 2016Updated 10 years ago
- Adds a Doctrine Id generator which uses an ordered UUID in MySQL for extra performance. Uses methods described in Karhik Appigatla's arti…☆10Jun 8, 2015Updated 11 years ago
- Keyboard-first dotfiles for terminal-centric development with tmux, Neovim, and coding agents.☆28Updated this week
- A Golang implementation of Keras.☆12Nov 24, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- 💾 Optimize Laravel caching with Cachetastic! Cache method results, force refresh, handle errors, and boost app performance effortlessly.☆13Jan 26, 2026Updated 4 months ago
- ☆14May 19, 2023Updated 3 years ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- 🐆A lightweight, high-performance string manipulation library optimized for speed-sensitive applications.☆16Mar 28, 2026Updated 2 months ago
- ☆17Apr 29, 2024Updated 2 years ago
- ☆12Sep 4, 2023Updated 2 years ago
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2…☆12Apr 24, 2026Updated last month
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆479Mar 15, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Finetune baichuan pretrained model with QLora method☆15Jul 13, 2023Updated 2 years ago
- Backpack Attachments is a FiveM resource for attaching weapons and items to players' backs. It supports customizable attachment points, h…☆10Nov 14, 2024Updated last year
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- Demo repository for article "Express server, Handlebars & Critical Path Performance Optimization"☆13Jan 12, 2017Updated 9 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- LeapRemote-Android Public OpenSource☆23Apr 12, 2025Updated last year
- A Particle System implemented in android, handling collinsions, optimized for performance☆10Dec 18, 2023Updated 2 years ago