shiyemin / light-hf-proxyLinks
A light proxy solution for HuggingFace hub.
☆47Updated last year
Alternatives and similar repositories for light-hf-proxy
Users that are interested in light-hf-proxy are comparing it to the libraries listed below
Sorting:
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- Gaokao Benchmark for AI☆108Updated 3 years ago
- Light local website for displaying performances from different chat models.☆87Updated last year
- ☆105Updated last year
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆112Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- ☆80Updated last year
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆91Updated 2 years ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 10 months ago
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated last year
- Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.☆136Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- SearchGPT: Building a quick conversation-based search engine with LLMs.☆46Updated 6 months ago
- GLM Series Edge Models☆146Updated last month
- SUS-Chat: Instruction tuning done right☆49Updated last year
- Imitate OpenAI with Local Models☆87Updated 11 months ago
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆145Updated last year
- ☆151Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆134Updated last year
- 骆驼QA,中文大语言阅读理解模型。☆74Updated 2 years ago
- a toolkit on knowledge distillation for large language models☆127Updated this week
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆125Updated 8 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated 2 years ago
- Mixture-of-Experts (MoE) Language Model☆189Updated 10 months ago