基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3
☆16Apr 24, 2024Updated 2 years ago
Alternatives and similar repositories for Llama3-Chinese-ORPO
Users that are interested in Llama3-Chinese-ORPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Jan 11, 2026Updated 5 months ago
- The trainer for HF to record losses of different tasks and objectives.☆54Mar 12, 2025Updated last year
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆31Dec 16, 2025Updated 6 months ago
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- pre-training llama3 using chinese☆13May 1, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- MLLM @ Game☆17May 12, 2025Updated last year
- 演示Gemma中文指令微调的教程☆45Feb 26, 2024Updated 2 years ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆36Aug 5, 2024Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆14Sep 1, 2025Updated 9 months ago
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated last year
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆113Aug 21, 2025Updated 9 months ago
- A mod that injects MGL and patches Minecraft to work with it.☆12Apr 10, 2024Updated 2 years ago
- cracked prompt of famous coding agent and autodev☆22Mar 19, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago
- [EMNLP 2023] Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models☆17Oct 30, 2023Updated 2 years ago
- This repo is reproduction resources for linear alignment paper, still working☆17May 19, 2024Updated 2 years ago
- PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".☆14Dec 22, 2021Updated 4 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆81Dec 8, 2025Updated 6 months ago
- ☆13Oct 23, 2018Updated 7 years ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Sep 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A discord bot with multiple features like music, reverse image search and more!☆10Jun 11, 2026Updated last week
- Survey on Data-centric Large Language Models☆93Jul 8, 2024Updated last year
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆131Jun 11, 2025Updated last year
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- Janus NDI Plugin☆14Nov 2, 2025Updated 7 months ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Run pytorch models on GPU Android with Vulkan backend☆10Aug 15, 2023Updated 2 years ago
- [Paper][COLING2022] Ruleformer: Context-aware Rule Mining over Knowledge Graph☆26Nov 30, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- LLM-driven browser automation library built on Playwright with 67 CLI/SDK tools, stable snapshot refs, and stealth mode.基于 Playwright 的 L…☆74May 19, 2026Updated last month
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- This project is designed to capture frames from the Ingenic T20 camera and write them to a V4L2 device.☆13Feb 20, 2023Updated 3 years ago
- This repository shows a demo of real-time Digital Makeup for a face. It can transference the hair style, foundation make-up, eyelash, lip…☆13Jul 15, 2018Updated 7 years ago
- A Command line interface for Srun3k Client for HAUT☆12Jul 3, 2018Updated 7 years ago
- Unofficial SDL with added custom native Visual Studio project build tools. SDL: SDL Simple DirectMedia Layer is a cross-platform develop…☆12Jan 26, 2026Updated 4 months ago
- 一个在线代码编辑器(online ide),可用于代码练习和题目AC。☆10Aug 1, 2022Updated 3 years ago