High-performance LLM inference based on our optimized version of FastTransfomer
☆122Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for FasterTransformer4CodeFuse
Users that are interested in FasterTransformer4CodeFuse are comparing it to the libraries listed below
Sorting:
- High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.☆708Dec 30, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- ☆40Oct 17, 2024Updated last year
- ☆22Dec 11, 2025Updated 2 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Runtimex package help to expose Go Runtime internals representation safely.☆13Feb 19, 2025Updated last year
- Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中☆105Apr 28, 2025Updated 10 months ago
- ☆20Nov 20, 2024Updated last year
- CodeFuse is an interview taking platform that leverages the power of OpenAI's GPT-3 for conducting interviews and providing responses. Th…☆13Dec 1, 2023Updated 2 years ago
- Getting started guide to using GPUs for nCoV2019 research☆14Apr 24, 2020Updated 5 years ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Easily optimize generic performance metrics in differentiable learning.☆18Jun 6, 2020Updated 5 years ago
- Code and data of "Controllable Unsupervised Event-based Video Generation" (accepted as ICIP oral and invited by WACV workshop)☆19Nov 5, 2024Updated last year
- ☆25Aug 23, 2024Updated last year
- ☆32Jun 24, 2025Updated 8 months ago
- ☆22Oct 21, 2024Updated last year
- ROUGE for multilingual Summarization☆25Oct 11, 2021Updated 4 years ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆174Aug 15, 2025Updated 6 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆56May 22, 2025Updated 9 months ago
- Autonomous Rust utility that load balances multiple https://ollama.com/ servers☆27Apr 9, 2025Updated 10 months ago
- Mindful is a mental wellness app designed to support users in managing stress and anxiety. Powered by advanced AI, it offers personalized…☆11Apr 11, 2025Updated 10 months ago
- LVCS@Tesla.com☆12Jan 16, 2026Updated last month
- X视频下载工具GUI☆13Dec 5, 2024Updated last year
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆33Apr 29, 2024Updated last year
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆29Dec 5, 2023Updated 2 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Nov 23, 2023Updated 2 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Aug 15, 2024Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆68Oct 14, 2024Updated last year
- 🐆A lightweight, high-performance string manipulation library optimized for speed-sensitive applications.☆14Jan 6, 2026Updated last month
- Repository for a 4-wheel robot car based on Arduino for the "Elegoo Smart Robot Car Kit V3.0 Plus" and similar ones.☆11Dec 20, 2022Updated 3 years ago
- Query-Based Code Analysis Engine☆348Sep 21, 2025Updated 5 months ago
- Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized mo…☆810Jun 3, 2024Updated last year
- [ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…☆33Jan 20, 2022Updated 4 years ago
- Classification of Single cells by Transfer Learning☆10Oct 11, 2025Updated 4 months ago
- tokviz is a Python library for visualizing tokenization patterns across different language models.☆12Apr 25, 2024Updated last year
- High performance async Mssql library for Python.☆17Feb 20, 2026Updated last week
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 2 years ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- A SystemVerilog-based simulation and design of a Last Level Cache (LLC) implementing the MESI protocol, featuring Pseudo-LRU replacement,…☆15Nov 24, 2025Updated 3 months ago