(1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。
☆45Jul 19, 2023Updated 2 years ago
Alternatives and similar repositories for BaiYang-chatGLM2-6B
Users that are interested in BaiYang-chatGLM2-6B are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆39Dec 15, 2024Updated last year
- ☆17Jul 10, 2023Updated 2 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆360Aug 22, 2023Updated 2 years ago
- 基于winform编写了一个美观的ChatGLM客户端,支持流式输出,兼容官方api☆30May 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A prompt set of ChatGLM-6B☆15Jul 21, 2023Updated 2 years ago
- A training and inference framework for open ner and re models! 信息抽取(实体抽取、关系抽取、事件抽取)模型的统一训练和推理框架,包含丰富的开源SOTA模型☆16Dec 31, 2024Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- This repository releases the code and data for utterance rewriting in open-domain dialogues.☆18Feb 24, 2023Updated 3 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago
- moss chat finetuning☆51Apr 23, 2024Updated last year
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …☆47Oct 8, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Stochastic Weight Averaging Tutorials using pytorch.☆33Oct 23, 2020Updated 5 years ago
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆33Nov 10, 2025Updated 5 months ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,193May 3, 2025Updated 11 months ago
- 一个漂亮的拼图人机验证,后端PHP,前端HTML☆11Feb 11, 2018Updated 8 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- 第一名克莱登大学二队方案分享☆18Mar 5, 2021Updated 5 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago
- 思维误区: 用理想模型来思考复杂现实问题☆40Oct 21, 2020Updated 5 years ago
- Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架,支持工具调用、RAG、多智能体和MCP。☆277Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Zhuang Chen, Tieyun Qian: Enhancing Aspect Term Extraction with Soft Prototypes. EMNLP 2020: 2107-2117☆12Dec 13, 2022Updated 3 years ago
- FinCUGE Instruction dataset☆15Apr 29, 2023Updated 2 years ago
- LLM+RAG for QA☆23Jan 15, 2024Updated 2 years ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 10 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,781Dec 12, 2023Updated 2 years ago
- 😄😐😠 情感分析(使用 emoji 可视化)☆10Sep 5, 2021Updated 4 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- Source code for Paper "Legal Feature Enhanced Semantic Matching Network for Similar Case Matching".☆15Feb 17, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A white box algorithm that generate adversarial examples according to the gradient☆11May 9, 2020Updated 5 years ago
- ☆13Jan 14, 2021Updated 5 years ago
- 微信小程序全局状态管理方案,提供响应式的app.globalData。☆11Jan 19, 2020Updated 6 years ago
- WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)☆1,605Mar 25, 2025Updated last year
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,723Oct 12, 2023Updated 2 years ago
- ☆15Nov 10, 2023Updated 2 years ago
- 弱鸡偶尔刷几道题☆10Apr 22, 2021Updated 4 years ago