High-performance LLM inference based on our optimized version of FastTransfomer
☆122Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆714Dec 30, 2024Updated last year
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆108Apr 28, 2025Updated 11 months ago
- ☆40Oct 17, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- Runtimex package help to expose Go Runtime internals representation safely.☆12Feb 19, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Easily optimize generic performance metrics in differentiable learning.☆18Jun 6, 2020Updated 5 years ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆177Aug 15, 2025Updated 7 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆62Jun 17, 2024Updated last year
- A powerful Laravel storage driver that enables seamless synchronization of files across multiple disks, with an integrated cache disk for…☆15Nov 11, 2025Updated 5 months ago
- Unofficial implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks☆28Sep 24, 2021Updated 4 years ago
- ☆127Apr 22, 2023Updated 2 years ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 2 months ago
- Welcome to My Repository. Here you'll find a stable, Magisk modules for performance, save Battery, for all rooted Android devices. Stay…☆22Aug 10, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- os lectures 2022 spring☆10Aug 25, 2025Updated 7 months ago
- 💾 Optimize Laravel caching with Cachetastic! Cache method results, force refresh, handle errors, and boost app performance effortlessly.☆13Jan 26, 2026Updated 2 months ago
- run ChatGLM2-6B in BM1684X☆49Mar 1, 2024Updated 2 years ago
- a simple theme for Hexo☆30Dec 17, 2023Updated 2 years ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolk…☆1,286Jul 1, 2024Updated last year
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2…☆11Feb 6, 2026Updated 2 months ago
- Efficient Hyper-parameter Tuning at Scale (VLDB'22)☆10Dec 1, 2021Updated 4 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆479Mar 15, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Backpack Attachments is a FiveM resource for attaching weapons and items to players' backs. It supports customizable attachment points, h…☆10Nov 14, 2024Updated last year
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.☆652Jul 10, 2024Updated last year
- ☆18Apr 14, 2021Updated 4 years ago
- LeapRemote-Android Public OpenSource☆23Apr 12, 2025Updated 11 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆34Jul 23, 2024Updated last year
- 本仓库用于收集和维护实用的油猴(Tampermonkey)脚本,旨在提升日常网页浏览和操作的效率。☆24May 29, 2025Updated 10 months ago
- Discover Netflix's Open Connect Appliance (OCA) assigned to your connection. This tool fetches and displays detailed connectivity and hos…☆18Jul 22, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Jan 1, 2022Updated 4 years ago
- ☆12Dec 30, 2023Updated 2 years ago
- A SystemVerilog-based simulation and design of a Last Level Cache (LLC) implementing the MESI protocol, featuring Pseudo-LRU replacement,…☆15Mar 8, 2026Updated last month
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆27Jun 5, 2025Updated 10 months ago
- Sekai Viewer but built with Next, optimized for performance☆11Jan 20, 2023Updated 3 years ago
- Autonomous Rust utility that load balances multiple https://ollama.com/ servers☆26Apr 9, 2025Updated last year
- A curated paper list on LLM reasoning.☆90Mar 4, 2024Updated 2 years ago