A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.
☆104Jul 9, 2025Updated 9 months ago
Alternatives and similar repositories for smoothie-qwen
Users that are interested in smoothie-qwen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆64Jul 21, 2025Updated 8 months ago
- LLM 모델의 외국어 토큰 생성을 막는 코드 구현☆85Aug 7, 2025Updated 8 months ago
- ☆13Apr 17, 2024Updated 2 years ago
- [ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues☆26Jul 10, 2025Updated 9 months ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆19May 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…☆115Jun 25, 2025Updated 9 months ago
- ☆12Mar 25, 2022Updated 4 years ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 11 months ago
- StrategyQA 데이터 세트 번역☆22Apr 12, 2024Updated 2 years ago
- 금융 도메인에 특화된 한국어 임베딩 모델☆22Aug 8, 2024Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- 42dot LLM consists of a pre-trained language model, 42dot LLM-PLM, and a fine-tuned model, 42dot LLM-SFT, which is trained to respond to …☆132Mar 7, 2024Updated 2 years ago
- ☆116Feb 25, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Oct 21, 2022Updated 3 years ago
- The Universe of Evaluation. All about the evaluation for LLMs.☆235Jul 9, 2024Updated last year
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆29Feb 23, 2024Updated 2 years ago
- ☆39Mar 11, 2025Updated last year
- [KO-Platy🥮] Korean-Open-platypus를 활용하여 llama-2-ko를 fine-tuning한 KO-platypus model☆73Aug 24, 2025Updated 7 months ago
- ☆19Sep 20, 2022Updated 3 years ago
- Apple's Cut Cross Entropy☆30Jan 19, 2025Updated last year
- ☆103Apr 11, 2025Updated last year
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Weak Labeling (NER) using ChatGPT☆37Mar 28, 2023Updated 3 years ago
- Ollama MCP Agent allows you to use LLM models locally on your PC for free along with MCP additional features☆63May 6, 2025Updated 11 months ago
- 한국어 언어모델 다분야 사고력 벤치마크☆207Oct 17, 2024Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆210Apr 4, 2026Updated 2 weeks ago
- KoTAN: Korean Translation and Augmentation with fine-tuned NLLB☆23Jan 4, 2024Updated 2 years ago
- ☆36Oct 4, 2023Updated 2 years ago
- ☆10Sep 13, 2024Updated last year
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆59May 23, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 1-Click is all you need.☆63Apr 29, 2024Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Nov 12, 2022Updated 3 years ago
- Efficient fine-tuning for ko-llm models☆183Mar 18, 2024Updated 2 years ago
- Liner LLM Meetup archive☆70Mar 27, 2024Updated 2 years ago
- Welcome to the Storm Cookbook! This is your go to guide for Building with STORM Solution.☆39Aug 14, 2025Updated 8 months ago
- ☆15May 20, 2023Updated 2 years ago
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago