Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆31May 17, 2024Updated last year
Alternatives and similar repositories for gemma-sft
Users that are interested in gemma-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- 一款数据标注工具(仿照百度在线标注平台)☆13Jul 5, 2021Updated 4 years ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- ☆11Feb 3, 2025Updated last year
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆25Nov 25, 2025Updated 4 months ago
- Show your WakaTime statistics in a pinned gist for your GitHub profile☆11Updated this week
- AUTOMATIC111/stable-difusion-webui的Golang API服务端☆13Jul 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆30Aug 8, 2024Updated last year
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- PULSE-EVAL☆24Jan 12, 2024Updated 2 years ago
- Firmware for Xilinx Platform Cable 1 USB Jtag adapter☆10Jul 24, 2016Updated 9 years ago
- 这个是用c++获取机器mac地址,当前用户名,硬盘序列号,内存大小然后封装成dll给go调用的程序。☆12Nov 24, 2018Updated 7 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 11 months ago
- A virtual machine implementation of "伟福" COP2000 development board (microinstruction level)☆17Dec 22, 2022Updated 3 years ago
- ☆30Nov 5, 2024Updated last year
- 本项目主要对开源的MOSS SFT数据进行整理 ,转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面,共353w样本,MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数,共630w样本,☆12Dec 3, 2023Updated 2 years ago
- 使用深度学习模型LSTM和ConvLSTM结合Attention,对金融衍生品的成交持仓比指标进行预测☆19Jan 7, 2022Updated 4 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- 华东师范大学数据科学与工程学院 2018年本科生暑期项目进度汇总☆10Sep 17, 2018Updated 7 years ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated 11 months ago
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆14Nov 7, 2022Updated 3 years ago
- Siphon mock SSDB slave server, sync data between ssdb master and redis (or pika) server.☆13May 22, 2020Updated 5 years ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆15Apr 23, 2024Updated last year
- Covert Keras models to Pytorch☆12Dec 22, 2018Updated 7 years ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago