演示Gemma中文指令微调的教程
☆45Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for Gemma-Chinese-instruction-tuning
Users that are interested in Gemma-Chinese-instruction-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Jun 22, 2026Updated last week
- Baichuan2代码的逐行解析版本,适合小白☆211Sep 20, 2023Updated 2 years ago
- 使用ONNXRuntime部署PP-YOLOE目标检测,支持PP-YOLOE-s、PP-YOLOE-m、PP-YOLOE-l、PP-YOLOE-x四种结构,包含C++和Python两个版本的程序☆22Jun 10, 2022Updated 4 years ago
- ☆12Jul 18, 2023Updated 2 years ago
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 本项目采用Firefly模型训练框架,使用LLAMA-2模型对多项选择阅读理解任务(Multiple Choice MRC)进行微调,取得了显著的进步。☆11Sep 16, 2023Updated 2 years ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- CycleGAN with Spectral Normalization and Class Activation Mapping Attention implemented using MXNet.☆13Jul 25, 2021Updated 4 years ago
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆30Jan 27, 2026Updated 5 months ago
- ☆10Nov 21, 2023Updated 2 years ago
- Emacs 中看 B 站☆10Jul 27, 2025Updated 11 months ago
- ☆13Apr 18, 2024Updated 2 years ago
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- This is a c++11/lua wrapper for libevent.☆15Oct 20, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Experiments for handwriting recognition☆14Aug 25, 2020Updated 5 years ago
- 用结巴(Jieba)轻松实现细粒度分词☆16Nov 21, 2019Updated 6 years ago
- Code for "SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling"☆19Nov 22, 2022Updated 3 years ago
- ☆17Jan 1, 2019Updated 7 years ago
- 基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。☆14Jan 15, 2024Updated 2 years ago
- grep for context, not just text. Local-first CLI for searching documents, notes, memories, and project context.☆26Mar 8, 2026Updated 3 months ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆24Jul 26, 2023Updated 2 years ago
- Solve Geometric & Graph Problems with Large Language Models☆32Mar 6, 2023Updated 3 years ago
- u3d GPU插值顶点动画替换骨骼动画方案☆12Sep 28, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Aug 4, 2018Updated 7 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Ziya-LLaMA-13B是IDEA基于LLaMa的130亿参数的大规模预训练模型,具备 翻译,编程,文本分类,信息抽取,摘要,文案生成,常识问答和数学计算等能力。目前姜子牙通用大模型已完成大规模预训练、多任务有监督微调和人类反馈学习三阶段的训练过程。本文主要用于Ziya-…☆46Jun 9, 2023Updated 3 years ago
- EAST-inspired Tensorflow-based Text Detector☆11Feb 18, 2021Updated 5 years ago
- ☆30Jan 5, 2025Updated last year
- Common spatial analysis functions☆15Aug 27, 2021Updated 4 years ago
- NanoGPT (124M) in 5 minutes☆15Feb 14, 2025Updated last year
- This is a Repository corresponding to ACCV2022 accepted paper ”Complex Handwriting Trajectory Recovery: Evaluation Metrics and Algorithm“…☆14Oct 3, 2022Updated 3 years ago
- Project is intended to build and deploy an scene detection application onto Qualcomm Robotics development Kit (RB5) that detects whether …☆10Jun 26, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PersoSim - the open source eID simulator☆16May 21, 2026Updated last month
- a few utilities to analyze Caffe prototxt files☆16Sep 27, 2017Updated 8 years ago
- ☆13Dec 4, 2017Updated 8 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆32May 17, 2024Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 4 months ago
- 中文llama3大模型快速上手,通用中文语言大模型finetune教程,基于Meta-llama3实现。☆20Jun 19, 2024Updated 2 years ago