shiyemin / light-hf-proxyLinks
A light proxy solution for HuggingFace hub.
☆47Updated last year
Alternatives and similar repositories for light-hf-proxy
Users that are interested in light-hf-proxy are comparing it to the libraries listed below
Sorting:
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Updated 2 years ago
- 基于baichuan-7b的开源多模态大语言模型☆72Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆140Updated last year
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆92Updated 2 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆266Updated last year
- Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.☆137Updated 2 years ago
- ☆79Updated last year
- Light local website for displaying performances from different chat models.☆87Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆136Updated last year
- ☆106Updated 2 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- Gaokao Benchmark for AI☆108Updated 3 years ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。☆113Updated 2 years ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆126Updated 10 months ago
- SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准☆145Updated last year
- Our 2nd-gen LMM☆34Updated last year
- SUS-Chat: Instruction tuning done right☆49Updated last year
- zero零训练llm调参☆32Updated 2 years ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- 百度QA100万数据集☆48Updated last year
- ☆36Updated last year
- A more efficient GLM implementation!☆54Updated 2 years ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…☆40Updated last year