使用LoRA对ChatGLM进行微调。
☆49Jun 26, 2023Updated 2 years ago
Alternatives and similar repositories for ChatGLM-LoRA-Tuning
Users that are interested in ChatGLM-LoRA-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- ☆19Mar 11, 2026Updated 3 months ago
- RT from How far is Language Model from 100 medical NER☆11Dec 17, 2024Updated last year
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 3 years ago
- 半自动生成财务分析报告☆33Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,778Dec 12, 2023Updated 2 years ago
- 基于BERT-CRF的命名实体识别模型☆13Mar 14, 2022Updated 4 years ago
- An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extrac…☆134Jan 17, 2024Updated 2 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- This is the repository for the paper ‘A Survey of Inductive Reasoning for Large Language Models’ (ACL2026)☆46Apr 8, 2026Updated 2 months ago
- Chinese Financial Assistant Benchmark for Large Language Model☆55Jul 30, 2025Updated 10 months ago
- 大语言模型微调的项目,包含了使用QLora微调ChatGLM和LLama☆29Jun 26, 2023Updated 2 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆17Feb 7, 2019Updated 7 years ago
- This is the repository for the paper 'DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models' (EMNLP2024 …☆18Apr 5, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LLM for NER☆83Jul 29, 2024Updated last year
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 3 years ago
- 使用verilog编写sdram控制器☆13Jun 22, 2019Updated 6 years ago
- JDDC基线模型Seq2Seq☆43May 8, 2018Updated 8 years ago
- 中华经典文献数据集☆22Jun 29, 2023Updated 2 years ago
- 文言文命名实体识别,基于BILSTM+CRF完成文言文的命名实体实体,识别实体包括人物、地点、机构、时间等。☆10Jan 19, 2021Updated 5 years ago
- CNRec Data Associated with Content based News Recommendation via Shortest Entity Distance over Knowledge Graph☆10Feb 26, 2019Updated 7 years ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆17Nov 10, 2025Updated 7 months ago
- A dataset used for NLP tasks.☆10Apr 17, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆25May 7, 2025Updated last year
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated last year
- FinCUGE Instruction dataset☆16Apr 29, 2023Updated 3 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- ☆21Oct 6, 2021Updated 4 years ago
- ☆13Sep 28, 2020Updated 5 years ago
- GUI useful to manually annotate text for Named Entity Recognition purposes☆14Jun 22, 2023Updated 2 years ago
- ☆23Mar 9, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code of the "Graph-Bert: Only Attention is Needed for Learning Graph Representations" paper☆15Jan 22, 2020Updated 6 years ago
- The code of Team Rhinobird for Mining the Web of HTML-embedded Product Data Task One at ISWC2020☆14Aug 26, 2020Updated 5 years ago
- ☆11May 2, 2023Updated 3 years ago
- [NeurIPS 2023] TFLEX: Temporal Feature-Logic Embedding Framework for Complex Reasoning over Temporal Knowledge Graph☆43Oct 17, 2025Updated 8 months ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 4 years ago
- 2019搜狐第三届内容识别挑战赛rank10☆11Oct 17, 2019Updated 6 years ago
- Source code for the paper "Attention Is (not) All You Need for Commonsense Reasoning" published at ACL 2019.☆14Aug 2, 2024Updated last year