中文版 llm-numbers
☆131Dec 25, 2023Updated 2 years ago
Alternatives and similar repositories for llm-numbers-cn
Users that are interested in llm-numbers-cn are comparing it to the libraries listed below
Sorting:
- AskIt (for JavaScript/TypeScript): Unified programming interface for large language models (GPT-4, GPT-3.5)☆35Oct 1, 2023Updated 2 years ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Nov 4, 2023Updated 2 years ago
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- Ensō is a high-performance streaming interface for NIC-application communication.☆78Sep 4, 2025Updated 6 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- 📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉☆5,040Feb 27, 2026Updated last week
- Pre-clinical drug discovery faces the low efficiency dilemma. One of the reasons is the lack of cross-drug efficacy evaluation infrastruc…☆14Dec 8, 2025Updated 2 months ago
- a toolkit focus on mutual convert UI code of arkts(harmony)/vue/react/react native/mini programe(wx,tt,baidu,ks)/hap; 一个工具集致力提供鸿蒙arkts/vu…☆12Dec 3, 2024Updated last year
- FPGA Low latency 10GBASE-R PCS☆12May 23, 2023Updated 2 years ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆4,843Updated this week
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- Use strategy in stock transaction for high revenue.☆11Dec 24, 2015Updated 10 years ago
- Code for "Sample-efficient Deep Reinforcement Learning of Mobile Manipulation for 6-DOF Trajectory Following"☆13Mar 19, 2025Updated 11 months ago
- Canopy is a machine learning learning compiler stack with the capability of adopting high-end FPGAs. As a part of OpenAIOS project, Canop…☆12May 7, 2021Updated 4 years ago
- Link any file anywhere on your computer!☆11May 11, 2024Updated last year
- Implement some method of LLM KV Cache Sparsity☆40Jun 6, 2024Updated last year
- ☆10Aug 9, 2021Updated 4 years ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆94Nov 9, 2023Updated 2 years ago
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Nov 5, 2023Updated 2 years ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Feb 29, 2024Updated 2 years ago
- Evaluation for AI apps and agent☆44Jan 18, 2024Updated 2 years ago
- Disaggregated serving system for Large Language Models (LLMs).☆778Apr 6, 2025Updated 11 months ago
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…☆283Mar 6, 2025Updated last year
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Feb 22, 2026Updated last week
- Rust FFI example project for Java & Python☆10Jun 8, 2019Updated 6 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated 11 months ago
- ☆11Jan 20, 2023Updated 3 years ago
- Web of Science Alert to RSS☆10Nov 21, 2023Updated 2 years ago
- ☆17Mar 19, 2022Updated 3 years ago
- Load driver to orchestrate automated load tests on Marathon/Mesos.☆10Mar 31, 2014Updated 11 years ago
- ☆10May 14, 2023Updated 2 years ago
- A simple daemon to control fan speed on t2 Macs with patchched kernel. Visit https://t2linux.org for more information on the kernels☆11Aug 17, 2022Updated 3 years ago
- Allows you to use a macro prefix_all to prefix every attribute in structs and enums on serialization☆12Mar 28, 2021Updated 4 years ago
- This is an example showing how to export and import functions between a Rust application and Rust WebAssembly.☆15Jul 1, 2020Updated 5 years ago
- This repository supports the blog site www.cloudauditcontrols.com.☆15Dec 3, 2025Updated 3 months ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated last year
- ☆10Oct 22, 2024Updated last year