YueZhengMeng / MyLlama
手搓Llama,个人学习用
☆12Updated 10 months ago
Alternatives and similar repositories for MyLlama:
Users that are interested in MyLlama are comparing it to the libraries listed below
- ☆84Updated last year
- 基于DPO算法微调语言大模型,简单好上手。☆35Updated 8 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆77Updated 4 months ago
- ☆66Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆62Updated last month
- 怎么训练一个LLM分词器☆142Updated last year
- ☆9Updated last year
- 多轮共情对话模型PICA☆92Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- 使用单个24G显卡,从0开始训练LLM☆50Updated 5 months ago
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆56Updated last year
- ☆137Updated 11 months ago
- ☆81Updated last year
- ☆29Updated last year
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- ☆106Updated 9 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆54Updated 8 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆124Updated 9 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆53Updated last year
- learn jiu wan shier l☆52Updated 3 years ago
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆38Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Updated last year
- ☆15Updated last year
- Code for AAAI 2023 accepted paper titled "Knowledge-Bridged Causal Interaction Network for Causal Emotion Entailment"☆13Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆36Updated last month
- ☆26Updated 5 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆32Updated 3 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆100Updated last year
- 基于T5模型的中文文本纠错☆30Updated 4 months ago
- ☆95Updated last year