(1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。
☆45Jul 19, 2023Updated 2 years ago
Alternatives and similar repositories for BaiYang-chatGLM2-6B
Users that are interested in BaiYang-chatGLM2-6B are comparing it to the libraries listed below
Sorting:
- A training and inference framework for open ner and re models! 信息抽取(实体抽取、关系抽取、事件抽取)模型的统一训练和推理框架,包含丰富的开源SOTA模型☆14Dec 31, 2024Updated last year
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- ☆17Jul 10, 2023Updated 2 years ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆39Dec 15, 2024Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆359Aug 22, 2023Updated 2 years ago
- DeepSparkInference has selected 216 inference models of both small and large sizes. The small models cover fields such as computer vision…☆27Updated this week
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 3 years ago
- Automated DAOPHOT-based PSF photometry pipeline☆11Sep 5, 2025Updated 6 months ago
- ☆18Sep 23, 2025Updated 5 months ago
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆33Nov 10, 2025Updated 3 months ago
- Informative Conversational Query Rewriting☆37Jan 29, 2024Updated 2 years ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Sep 9, 2023Updated 2 years ago
- LLMs as Collaboratively Edited Knowledge Bases☆46Feb 8, 2026Updated last month
- Humanable Chat Generative-model Fine-tuning | LLM微调☆207Sep 22, 2023Updated 2 years ago
- ☆12Sep 25, 2023Updated 2 years ago
- Clustering algorithms processing methods on astronomical spectra.☆10Oct 24, 2023Updated 2 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,780Dec 12, 2023Updated 2 years ago
- A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.☆51May 16, 2024Updated last year
- 思维误区: 用理想模型来思考复杂现实问题☆40Oct 21, 2020Updated 5 years ago
- deep learning☆150May 6, 2025Updated 10 months ago
- WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)☆1,606Mar 25, 2025Updated 11 months ago
- Evolution of the CSharpFits (1.1) library from http://vo.iucaa.ernet.in/~voi/CSharpFITS.html☆10Sep 28, 2025Updated 5 months ago
- Protocol buffers and other common resources.☆13Mar 2, 2026Updated last week
- A SVR implementation using MATLAB quadprog☆11Aug 29, 2017Updated 8 years ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- Including several social-media-computing tools.☆11Jan 4, 2019Updated 7 years ago
- 一个漂亮的拼图人机验证,后端PHP,前端HTML☆11Feb 11, 2018Updated 8 years ago
- ☆10Apr 27, 2021Updated 4 years ago
- vibe cli 模版☆17Jul 2, 2025Updated 8 months ago
- Matlab code for "Joint Projection Learning and Tensor Decomposition Based Incomplete Multi-view Clustering".☆10Jun 5, 2023Updated 2 years ago
- 深度矩阵分解模型 与 带注意力的深度矩阵分解模型☆10May 17, 2018Updated 7 years ago
- ☆102Dec 23, 2024Updated last year
- 我们是第一个完全可商用的角色大模型。☆40Aug 11, 2024Updated last year
- 一个基于 Paddle Inference 封装的用于快速部署的高层 API☆33Nov 13, 2021Updated 4 years ago
- ☆28Jan 5, 2026Updated 2 months ago
- ML framework to estimate Bayesian posteriors of galaxy morphological parameters☆11Jul 10, 2025Updated 7 months ago
- 常用开源软件(Jaeger,grafana,consul,prometheus,nginx-ingress-controller)及常用资源(deployment,svc,ingress...) K8s部署Yaml合集☆12Jun 27, 2020Updated 5 years ago