Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆32May 17, 2024Updated 2 years ago
Alternatives and similar repositories for gemma-sft
Users that are interested in gemma-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- WikiQA,复现论文《Multihop Atention Networks for Qestion Answer Matching》☆11Mar 25, 2019Updated 7 years ago
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- This project is a basic text-based adventure game. The idea of the game was inspired by Colossal Cave Adventure.☆13Jul 21, 2021Updated 4 years ago
- Automatic and generic measures of verbal alignment in dyadic dialogue based on sequential pattern mining at the level of surface of text …☆13May 11, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated 2 years ago
- Resources for CMT122 students (2024-2025).☆14Jan 4, 2026Updated 4 months ago
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- 爬取同花顺的股票(A股)信息☆10Nov 5, 2021Updated 4 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated 2 years ago
- This repo contains projects related to Vision, NLP and Reinforcement Learning☆16Apr 30, 2022Updated 4 years ago
- Keras CNN multi model (Custom + LeNet-5) ensemble with voting on MNIST dataset☆11Jan 29, 2019Updated 7 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Show your WakaTime statistics in a pinned gist for your GitHub profile☆12May 18, 2026Updated last week
- 语音识别 论文 前沿☆53Jan 8, 2022Updated 4 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- type vmess domain of v2ray, convert to ip address.☆10Nov 16, 2020Updated 5 years ago
- AUTOMATIC111/stable-difusion-webui的Golang API服务端☆13Jul 10, 2023Updated 2 years ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- 详细介绍知名大厂在搜索、推荐、广告等工业界的实践、前沿论文、技术干货分享☆21Mar 24, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- 无名杀Windows/Linux电脑版专属扩展,把zip文件(离线包,扩展或素材压缩包)拖入到游戏内即可导入☆12Dec 19, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PULSE-EVAL☆24Jan 12, 2024Updated 2 years ago
- 这个是用c++获取机器mac地址,当前用户名,硬盘序列号,内存大小然后封装成dll给go调用的程序。☆12Nov 24, 2018Updated 7 years ago
- ☆24May 13, 2026Updated 2 weeks ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 6 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 4 years ago
- LLM as World Models using Bayesian inference☆18May 27, 2025Updated last year
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 3 years ago
- Human activity recognition using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six c…☆16Mar 29, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆14May 25, 2023Updated 3 years ago
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆13Dec 3, 2023Updated 2 years ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆50Nov 27, 2025Updated 6 months ago
- 字符相似度, 汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used f…☆23Jul 5, 2025Updated 10 months ago
- ☆16May 31, 2024Updated last year
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated 2 years ago