huxiaosheng123 / open-llama2
从预训练到强化学习的中文llama2
☆94Updated last year
Related projects ⓘ
Alternatives and complementary repositories for open-llama2
- Chinese large language model☆132Updated last year
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆244Updated 9 months ago
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆155Updated 4 months ago
- improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.☆534Updated 7 months ago
- 保险行业回访外呼机器人☆74Updated last year
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆184Updated last week
- RASA中文任务型机器人☆99Updated 3 weeks ago
- Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor. …☆167Updated 2 years ago
- [EMNLP 2023] FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models☆86Updated 11 months ago
- Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"☆983Updated 11 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆47Updated 11 months ago
- An LLM-based tool to chat with your documents and databases, including a management system | 面向企业内部环境的大模型(LLM)知识库问答系统,包含后台管理系统☆97Updated last year
- Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conv…☆381Updated last month
- The framework to prune LLMs to any size and any config.☆99Updated 8 months ago
- 本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。☆55Updated 8 months ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆66Updated last month
- HuatuoGPT2, One-stage Training for Medical Adaption of LLMs. (An Open Medical GPT)☆365Updated 2 months ago
- This is a repo for my NanoGPT Pytorch2.0 Implementation when torch2.0 released soon, faster and simpler, a good tutorial learning GPT.☆60Updated 9 months ago
- notes for Multi-hop Reading Comprehension and open-domain question answering☆87Updated 2 years ago
- 【grps接入trtllm】通过GPRS+TensorRT-LLM+Tokenizers.cpp实现纯C++版高性能OpenAI LLM服务,支持chat和function call模式,支持ai agent,支持分布式多卡推理,支持多模态,支持gradio聊天界面。☆92Updated 3 weeks ago
- bert、roberta ner命名实体识别☆102Updated 2 years ago
- “英特尔创新大师杯”深度学习挑战赛 赛道3:CCKS2021中文NLP地址相关性任务☆155Updated 2 years ago
- Multilingual Corpus of Web Fiction☆216Updated 4 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆215Updated this week
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆350Updated 2 weeks ago
- Support mixed-precsion inference with vllm☆97Updated 2 weeks ago
- 接地气的 大模型工程,争取成为一本大模型实战百科全书☆17Updated last year
- Controllable Text Generation for Large Language Models: A Survey☆142Updated 2 months ago