longyuewangdcu / Chinese-Llama-2Links
improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.
☆445Updated last year
Alternatives and similar repositories for Chinese-Llama-2
Users that are interested in Chinese-Llama-2 are comparing it to the libraries listed below
Sorting:
- Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"☆914Updated 2 years ago
- Multilingual Corpus of Web Fiction☆198Updated last year
- 从预训练到强化学习的中文llama2☆87Updated 2 years ago
- adds Sequence Parallelism into LLaMA-Factory☆600Updated 2 months ago
- An MBTI Exploration of Large Language Models☆520Updated last year
- Chinese large language model☆123Updated 2 years ago
- [COLM’25] DeepRetrieval — 🔥 Training Search Agent by RLVR with Retrieval Outcome☆687Updated 2 months ago
- Code and data for crosstalk text generation tasks, exploring whether large models and pre-trained language models can understand humor. …☆147Updated 3 years ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆180Updated 7 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆281Updated 9 months ago
- 最容易上手的0门槛 chatglm3 & agent & langchain 项目☆233Updated last year
- Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,038Updated 2 years ago
- Complex Reasoning Rag System, Agentic Rag System☆239Updated 3 weeks ago
- an approximate implementation similar to chatpdf☆188Updated last year
- [ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列☆1,072Updated last year
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆151Updated 11 months ago
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆159Updated 2 months ago
- The framework to prune LLMs to any size and any config.☆94Updated last year
- ☆330Updated 4 months ago
- A scalable, end-to-end training pipeline for general-purpose agents☆363Updated 6 months ago
- An Innovative Agent Framework Driven by KG Engine☆772Updated 11 months ago
- LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from…☆386Updated last year
- Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.☆646Updated last year
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…☆312Updated 4 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆483Updated 2 months ago
- 百亿参数的中英文双语基座大模型☆2,415Updated 2 years ago
- Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"☆198Updated last year
- Tracking the progress in SLU (resources, code, and new frontiers etc.)☆897Updated 2 years ago
- ☆102Updated 2 years ago
- ☆46Updated 9 months ago