shisa-ai / shisa-v2Links
Japanese / English Bilingual LLM
☆28Updated this week
Alternatives and similar repositories for shisa-v2
Users that are interested in shisa-v2 are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- Project of llm evaluation to Japanese tasks☆90Updated 2 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆71Updated last year
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 11 months ago
- ☆58Updated last year
- ☆41Updated last year
- ☆43Updated last year
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Updated 3 weeks ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆27Updated 2 years ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 9 months ago
- entropix style sampling + GUI☆27Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆34Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 10 months ago
- A massively multilingual modern encoder language model☆116Updated 2 months ago
- ☆50Updated last year
- ☆53Updated 10 months ago
- Do Multilingual Language Models Think Better in English?☆43Updated 2 years ago
- たまに追加される論文メモ☆62Updated this week
- Scrape and export data from the Open LLM Leaderboard.☆48Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Updated last year
- ☆24Updated 10 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Updated 2 years ago
- ☆62Updated last year
- you.com's framework for evaluating deep research systems.☆58Updated 7 months ago
- ☆56Updated 5 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Hosting the JSON for the GPT4 Tokenizer☆64Updated 2 years ago
- Evaluating LLMs with CommonGen-Lite☆93Updated last year