Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆32May 17, 2024Updated last year
Alternatives and similar repositories for gemma-sft
Users that are interested in gemma-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- WikiQA,复现论文《Multihop Atention Networks for Qestion Answer Matching》☆11Mar 25, 2019Updated 7 years ago
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated last year
- ☆11Feb 3, 2025Updated last year
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- ☆45Oct 29, 2025Updated 6 months ago
- Free chrome extension to summarize articles on the web using ChatGPT AI☆18Jan 7, 2023Updated 3 years ago
- WolvCtf-2023-Challenges-Public☆12Apr 13, 2023Updated 3 years ago
- Demo of using WASM to sandbox Plotly execution☆20Mar 30, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Show your WakaTime statistics in a pinned gist for your GitHub profile☆12Updated this week
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- A Android client of Stable Diffusion.☆13Mar 29, 2024Updated 2 years ago
- AUTOMATIC111/stable-difusion-webui的Golang API服务端☆13Jul 10, 2023Updated 2 years ago
- ☆30Aug 8, 2024Updated last year
- ☆19Sep 24, 2022Updated 3 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- 无名杀Windows/Linux电脑版专属扩展,把zip文件(离线包,扩展或素材压缩包)拖入到游戏内即可导入☆12Dec 19, 2025Updated 4 months ago
- PULSE-EVAL☆24Jan 12, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- LLM as World Models using Bayesian inference☆17May 27, 2025Updated 11 months ago
- Emacs 中看 B 站☆11Jul 27, 2025Updated 9 months ago
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 3 years ago
- ☆13May 25, 2023Updated 2 years ago
- Scrape Any Website with DeepSeek and Ollama Locally for Free☆17Feb 7, 2025Updated last year
- 字符相似度, 汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used f…☆22Jul 5, 2025Updated 10 months ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Apr 30, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Range-based algorithms in Go☆14Sep 10, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated last year
- Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.☆18Apr 18, 2023Updated 3 years ago
- nmap detection scripts for CVE-2022-45477, CVE-2022-45479, CVE-2022-45482, CVE-2022-45481☆16Apr 19, 2024Updated 2 years ago
- ☆16May 28, 2017Updated 8 years ago