finetune llama2 with traditional chinese dataset
☆39Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for traditional_chinese_llama2
Users that are interested in traditional_chinese_llama2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆136Mar 28, 2023Updated 3 years ago
- ☆12Oct 28, 2025Updated 5 months ago
- Chang Gung University Computer Science / Artificial Intelligence learning material☆27Sep 5, 2024Updated last year
- Code for SRMRL☆19Sep 5, 2021Updated 4 years ago
- ☆18Oct 14, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Finetune LLaMA-7B with Chinese instruction datasets☆137May 8, 2023Updated 2 years ago
- ☆19Nov 1, 2021Updated 4 years ago
- 整理 DevOps Taiwan FB 社團上的貼文☆19Sep 15, 2016Updated 9 years ago
- ☆24Oct 19, 2021Updated 4 years ago
- Collection of papers using LLaMA as backbone model☆47Apr 6, 2025Updated 11 months ago
- Traditional Mandarin LLMs for Taiwan☆1,399Apr 20, 2025Updated 11 months ago
- ☆14Aug 16, 2023Updated 2 years ago
- Transitioning from Open-Domain Chit-Chat to Task-Oriented Dialogues☆43Apr 25, 2022Updated 3 years ago
- Chunk-based neural machine translation☆17Apr 24, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multiling…☆12Jan 25, 2019Updated 7 years ago
- This package supports implementation of anchor-based topic modeling and variants of the anchoring algorithm in Python 3.☆15Sep 17, 2018Updated 7 years ago
- ☆12May 23, 2022Updated 3 years ago
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Dec 8, 2023Updated 2 years ago
- ☆25Nov 17, 2020Updated 5 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- ☆77Jun 28, 2025Updated 9 months ago
- マウスクリックで指定した座標を矩形に射影変換するプログラム。☆10Jul 9, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- 批踢踢動態密碼☆18May 31, 2020Updated 5 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 6 months ago
- A study group related to GNN☆26May 15, 2020Updated 5 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Aug 10, 2023Updated 2 years ago
- GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems☆10Jul 7, 2022Updated 3 years ago
- Awesome list for High Performance Computing / Parallel Computing resources.☆12Sep 20, 2017Updated 8 years ago
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- Original code for our work on Sentiment Look-ahead.☆18Apr 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- ☆20Sep 11, 2025Updated 6 months ago
- Japanese / English Bilingual LLM☆28Dec 23, 2025Updated 3 months ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。☆268Sep 8, 2025Updated 6 months ago
- ☆12Mar 4, 2025Updated last year
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago