Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆32May 17, 2024Updated last year
Alternatives and similar repositories for gemma-sft
Users that are interested in gemma-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- 一款数据标注工具(仿照百度在线标注平台)☆13Jul 5, 2021Updated 4 years ago
- WikiQA,复现论文《Multihop Atention Networks for Qestion Answer Matching》☆11Mar 25, 2019Updated 7 years ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- 爬取同花顺的股票(A股)信息☆10Nov 5, 2021Updated 4 years ago
- ☆10Dec 28, 2023Updated 2 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- ☆19Apr 14, 2025Updated 11 months ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- Show your WakaTime statistics in a pinned gist for your GitHub profile☆10Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 语音识别 论文 前沿☆52Jan 8, 2022Updated 4 years ago
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆20Jul 13, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- A Android client of Stable Diffusion.☆13Mar 29, 2024Updated 2 years ago
- ☆29Aug 8, 2024Updated last year
- Firmware for Xilinx Platform Cable 1 USB Jtag adapter☆10Jul 24, 2016Updated 9 years ago
- PULSE-EVAL☆24Jan 12, 2024Updated 2 years ago
- 本项目旨在设计和实现一个基于深度学习的加密恶意流量检测系统。通过将网络流量数据转换为图像数据,并利用图像分类模型进行检测,该系统能够有效地检测加密恶意流量。在数据集选取上使用了包含加密恶意和正常流量的数据集,以更好地反映实际网络环境中的特征和行为模式。在数据预处理方面,通过…☆14Feb 28, 2024Updated 2 years ago
- 这个是用c++获取机器mac地址,当前用户名,硬盘序列号,内存大小然后封装成dll给go调用的程序。☆12Nov 24, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- Emacs 中看 B 站☆11Jul 27, 2025Updated 8 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 10 months ago
- ☆13May 25, 2023Updated 2 years ago
- ☆30Nov 5, 2024Updated last year
- 字符相似度, 汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used f…☆22Jul 5, 2025Updated 8 months ago
- ☆16May 31, 2024Updated last year
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Mar 5, 2024Updated 2 years ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- ☆13Nov 5, 2024Updated last year
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆12May 5, 2025Updated 10 months ago
- nmap detection scripts for CVE-2022-45477, CVE-2022-45479, CVE-2022-45482, CVE-2022-45481☆16Apr 19, 2024Updated last year
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago