longyuewangdcu / Chinese-Llama-2Links

improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.

☆448

Alternatives and similar repositories for Chinese-Llama-2

Users that are interested in Chinese-Llama-2 are comparing it to the libraries listed below

Sorting:

thunlp / WebCPM
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
☆917Updated last year
longyuewangdcu / GuoFeng-Webnovel
Multilingual Corpus of Web Fiction
☆195Updated last year
PKU-YuanGroup / Machine-Mindset
An MBTI Exploration of Large Language Models
☆489Updated last year
huxiaosheng123 / open-llama2
从预训练到强化学习的中文llama2
☆88Updated last year
Qihoo360 / 360-LLaMA-Factory
adds Sequence Parallelism into LLaMA-Factory
☆525Updated last week
pat-jj / DeepRetrieval
DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning
☆580Updated 3 weeks ago
dandelionsllm / pandallm
Panda项目是于2023年5月启动的开源海外中文大语言模型项目，致力于大模型时代探索整个技术栈，旨在推动中文自然语言处理领域的创新和合作。
☆1,039Updated last year
enze5088 / Chatterbox
Chinese large language model
☆121Updated 2 years ago
xxw1995 / chatglm3-finetune
最容易上手的0门槛 chatglm3 & agent & langchain 项目
☆220Updated last year
FreedomIntelligence / crosstalk-generation
Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor. …
☆147Updated 2 years ago
IAAR-Shanghai / UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
☆169Updated last month
Ledzy / BAdam
[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
☆263Updated 4 months ago
OpenBMB / VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
☆1,063Updated last year
Alpha-Innovator / DocGenome
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
☆137Updated 6 months ago
Simple-Efficient / RL-Factory
Train your Agent model via our easy and efficient framework
☆1,258Updated last week
wei-potato / Train-llm-from-scratch
使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力
☆154Updated last year
Ulov888 / chatpdflike
an approximate implementation similar to chatpdf
☆188Updated 8 months ago
codefuse-ai / codefuse-devops-eval
Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.
☆637Updated last year
cmriat / l0
A scalable, end-to-end training pipeline for general-purpose agents
☆258Updated last week
yizhen20133868 / Awesome-SLU-Survey
Tracking the progress in SLU (resources, code, and new frontiers etc.)
☆894Updated last year
jordddan / Pruning-LLMs
The framework to prune LLMs to any size and any config.
☆93Updated last year
codefuse-ai / CodeFuse-muAgent
An Innovative Agent Framework Driven by KG Engine
☆767Updated 6 months ago
minghao-wu / transagents
The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long…
☆571Updated 2 months ago
IceBearAI / LLM-And-More
LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from…
☆383Updated last year
HITsz-TMG / UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
☆741Updated 2 months ago
OpenBMB / CPM-Live
Live Training for Open-source Big Models
☆505Updated 2 years ago
KodCode-AI / kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆238Updated last month
Alpha-Innovator / SurveyForge
(ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…
☆268Updated 2 weeks ago
longyuewangdcu / Document-MT-LLM
☆102Updated 2 years ago
OPPO-PersonalAI / TaskCraft
A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.
☆110Updated last week