基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3
☆16Apr 24, 2024Updated last year
Alternatives and similar repositories for Llama3-Chinese-ORPO
Users that are interested in Llama3-Chinese-ORPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated last year
- A LLM Paper note list.☆21Apr 6, 2024Updated 2 years ago
- ☆30Jan 11, 2026Updated 3 months ago
- The trainer for HF to record losses of different tasks and objectives.☆54Mar 12, 2025Updated last year
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- pre-training llama3 using chinese☆13May 1, 2024Updated last year
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- Pytorch🍊🍉 is delicious, just eat it! 😋😋☆10Feb 13, 2026Updated 2 months ago
- 演示Gemma中文指令微调的教程☆45Feb 26, 2024Updated 2 years ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆36Aug 5, 2024Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆13Sep 1, 2025Updated 7 months ago
- Knowledge Graph Model Hub, A repository that integrates commonly used static knowledge graphs and temporal knowledge graphs method.☆68Aug 13, 2025Updated 8 months ago
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- cracked prompt of famous coding agent and autodev☆24Mar 19, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11May 8, 2020Updated 5 years ago
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆103Jan 30, 2024Updated 2 years ago
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago
- Turn any Windows precision touchpad into a touchscreen.☆12Oct 21, 2018Updated 7 years ago
- [EMNLP 2023] Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models☆17Oct 30, 2023Updated 2 years ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆24Jul 26, 2023Updated 2 years ago
- Implements a minimalistic version of Stable Cascade training☆13Oct 24, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆11Mar 14, 2023Updated 3 years ago
- Translator made fully in Python Vanilla that is able to translate in: Simplified Mandarin Chinese, Traditional Mandarin Chinese, Chinese …☆15May 28, 2023Updated 2 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆78Dec 8, 2025Updated 4 months ago
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Sep 21, 2022Updated 3 years ago
- ☆27Feb 1, 2025Updated last year
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- 蒼蟬,一個極簡風格的倉頡練習軟體☆16Aug 31, 2025Updated 7 months ago
- Enables MacBook trackpad haptic feedback for Windows toast notifications☆18Dec 20, 2019Updated 6 years ago
- fast_faceswap use dlib and change_style_network(基于dlib和风格迁移网络的快速换脸)☆11Jul 18, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Janus NDI Plugin☆14Nov 2, 2025Updated 5 months ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- [Paper][COLING2022] Ruleformer: Context-aware Rule Mining over Knowledge Graph☆26Nov 30, 2022Updated 3 years ago
- This project is designed to capture frames from the Ingenic T20 camera and write them to a V4L2 device.☆13Feb 20, 2023Updated 3 years ago
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- 南京大学机器学习课程oj☆25Jun 14, 2019Updated 6 years ago