pre-training llama3 using chinese
☆13May 1, 2024Updated last year
Alternatives and similar repositories for llama3-Chinese
Users that are interested in llama3-Chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- Source code of paper "Systematic Assessment of Factual Knowledge in Large Language Models" - EMNLP Findings 2023☆17Mar 17, 2026Updated last week
- ☆18Apr 18, 2025Updated 11 months ago
- An AI agent memory framework that converts an agent’s own interaction traces—both successes and failures—into reusable, high-level reason…☆53Feb 9, 2026Updated last month
- COLING 2025: MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity☆26Dec 23, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated last year
- Go bindings for LLama.cpp☆14Apr 11, 2023Updated 2 years ago
- 提取出判决书中的金额项和金额数。☆11Apr 8, 2016Updated 9 years ago
- pubg_sdk☆11Jul 26, 2020Updated 5 years ago
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- 基于KNN、神经网络、随机森林的权重的足球比赛预测☆22Updated this week
- 股票相关数据爬取整理, 行情实时监控☆14Nov 7, 2024Updated last year
- Automatically exported from code.google.com/p/hf-2011☆15Feb 12, 2016Updated 10 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆14Jan 4, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Mar 23, 2024Updated 2 years ago
- MDClub 的 JavaScript 版 SDK☆12May 29, 2022Updated 3 years ago
- 哈工大计算机系统(csapp)学习资料汇总,包括slides、实验、大作业和期末试题,供学习参考☆30Nov 27, 2021Updated 4 years ago
- DeepSearch - Advanced Web Dir Scanner☆14Nov 13, 2018Updated 7 years ago
- In-context learning, Fine-Tuning, RLHF on Flan-T5☆13Aug 30, 2023Updated 2 years ago
- ☆17Apr 18, 2024Updated last year
- 简单的 AIGC 微服务,可通过 HTTP、gRPC 连接,支持流式回答。☆10Mar 23, 2023Updated 3 years ago
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago
- ☆20May 12, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of RLHF (Reinforcement Learning with Human Feedback) and GAN (Generative Adversarial Network) on top of the T5 architectur…☆17Jan 2, 2023Updated 3 years ago
- 今日头条搜索引擎以及新闻详情页爬虫(Selenium)☆15Mar 13, 2025Updated last year
- This project aims to make a quantitative analysis of the New York City Taxi and Limousine Service (TLC) Trip Record Data.☆16Oct 19, 2023Updated 2 years ago
- 基于市值、资金流、换手率、KDJ综合权重的神经网络股票预测☆21Jan 22, 2026Updated 2 months ago
- AGPL licensed Octotree fork☆13Dec 22, 2020Updated 5 years ago
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- Webpage backgrounds created using the HTML5 Canvas API and jwagner's Simplex Noise library in React.js☆15Oct 26, 2023Updated 2 years ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model☆21Aug 20, 2024Updated last year
- PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。☆14May 17, 2021Updated 4 years ago
- fast_faceswap use dlib and change_style_network(基于dlib和风格迁移网络的快速换脸)☆11Jul 18, 2019Updated 6 years ago
- Head used in Poppy Torso and Poppy Humanoid☆15Jul 3, 2021Updated 4 years ago
- ☆21Jul 20, 2024Updated last year
- Janus NDI Plugin☆13Nov 2, 2025Updated 4 months ago
- Convert map[string]string into map[string]interface using a reference struct. Optionally respect json tags.☆13May 27, 2016Updated 9 years ago