LORA微调BLOOMZ,参考BELLE
☆25Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for BELLE-LORA
Users that are interested in BELLE-LORA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- 用多层BLSTM模型同时进行中文分词和标点符号预测☆18Nov 8, 2024Updated last year
- 2019达观杯实体识别☆19Sep 12, 2019Updated 6 years ago
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆389Apr 4, 2023Updated 3 years ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Question and answer system based on sentence similarity☆25Apr 29, 2019Updated 7 years ago
- ☆17Oct 22, 2020Updated 5 years ago
- Finetune Bloom big language model with Lora method☆32Jun 9, 2023Updated 2 years ago
- This repository is for the paper "Confusionset-guided Pointer Networks for Chinese Spelling Check"☆59Oct 25, 2019Updated 6 years ago
- Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification☆45Feb 21, 2023Updated 3 years ago
- Use the famous language model, xlnet, to do sequence tagging/ sequence labelling/ named entity recognition(NER) / noun extraction;☆18Sep 30, 2019Updated 6 years ago
- https://github.com/ARM-software/ML-KWS-for-MCU☆14Jul 8, 2018Updated 7 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MCP server for creating UI flowcharts☆11Jan 5, 2025Updated last year
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 8 months ago
- ☆13Sep 25, 2024Updated last year
- [ICLR 2023] Official repository of the paper "Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning"☆19Feb 19, 2023Updated 3 years ago
- This repo provides the implemetation of the paper How to train your agent to read and write?☆10Dec 29, 2020Updated 5 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- 汉语框架语义解析☆18Apr 25, 2023Updated 3 years ago
- A Slot-filling based Dialog Manager for Task-oriented Bot☆12Dec 29, 2016Updated 9 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆73Jun 5, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于pytorch的GlobalPointer进行中文命名实体识别。☆39Jul 7, 2023Updated 2 years ago
- EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件 。☆10Nov 26, 2023Updated 2 years ago
- ☆18Jul 25, 2025Updated 10 months ago
- Multimodal extreme classification☆21May 1, 2024Updated 2 years ago
- Convert Huggingface Pytorch checkpoint to Tensorflow checkpoint☆17Sep 4, 2023Updated 2 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- 医学预训练语言模型☆18Dec 17, 2020Updated 5 years ago
- Neural network sequence labeling model - some sloppy modifications to the original toolkit to enable punctuation restoration in unsegment…☆10Jan 8, 2017Updated 9 years ago
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…☆10Apr 9, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- About Hierarchical Muti-Label Text Classification based on hybrid method (local & global).☆16Jan 8, 2019Updated 7 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- ☆14Aug 26, 2024Updated last year
- ☆12Nov 23, 2020Updated 5 years ago
- A tensorflow implementation of VHRED(A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues)☆17Mar 24, 2019Updated 7 years ago
- llama,chatglm 等模型的微调☆91Jul 18, 2024Updated last year
- 英文文献的《中国图书馆分类法》自动标注小程序☆12Oct 29, 2024Updated last year