A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.
☆104Jul 9, 2025Updated 8 months ago
Alternatives and similar repositories for smoothie-qwen
Users that are interested in smoothie-qwen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆64Jul 21, 2025Updated 8 months ago
- ☆12Apr 17, 2024Updated last year
- [ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues☆26Jul 10, 2025Updated 8 months ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆19May 13, 2024Updated last year
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Mar 25, 2022Updated 4 years ago
- ☆11Sep 19, 2025Updated 6 months ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 11 months ago
- StrategyQA 데이터 세트 번역☆23Apr 12, 2024Updated last year
- 금융 도메인에 특화된 한국어 임베딩 모델☆22Aug 8, 2024Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- 42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …☆132Mar 7, 2024Updated 2 years ago
- ☆116Feb 25, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆10Oct 21, 2022Updated 3 years ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆233Jul 9, 2024Updated last year
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆29Feb 23, 2024Updated 2 years ago
- ☆39Mar 11, 2025Updated last year
- ☆19Sep 20, 2022Updated 3 years ago
- Apple's Cut Cross Entropy☆30Jan 19, 2025Updated last year
- ☆103Apr 11, 2025Updated 11 months ago
- Ollama MCP Agent allows you to use LLM models locally on your PC for free along with MCP additional features☆62May 6, 2025Updated 10 months ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 한국어 언어모델 다분야 사고력 벤치마크☆201Oct 17, 2024Updated last year
- Weak Labeling (NER) using ChatGPT☆37Mar 28, 2023Updated 3 years ago
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆15Jul 19, 2025Updated 8 months ago
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated last year
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆209Feb 26, 2026Updated last month
- KoTAN: Korean Translation and Augmentation with fine-tuned NLLB☆23Jan 4, 2024Updated 2 years ago
- ☆36Oct 4, 2023Updated 2 years ago
- ☆10Sep 13, 2024Updated last year
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드