yiyepiaoling0715 / codellm-data-preprocess-pipeline
代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota
☆38Updated 9 months ago
Alternatives and similar repositories for codellm-data-preprocess-pipeline:
Users that are interested in codellm-data-preprocess-pipeline are comparing it to the libraries listed below
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- ☆143Updated 10 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …