shisa-ai / shisa-v2Links
Japanese / English Bilingual LLM
☆27Updated this week
Alternatives and similar repositories for shisa-v2
Users that are interested in shisa-v2 are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- ☆42Updated last year
- Project of llm evaluation to Japanese tasks☆90Updated 2 weeks ago
- ☆41Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 9 months ago
- ☆58Updated last year
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- ☆50Updated last year
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- Ongoing research training Mixture of Expert models.☆21Updated last year
- Hosting the JSON for the GPT4 Tokenizer☆64Updated 2 years ago
- entropix style sampling + GUI☆27Updated last year
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆25Updated 5 months ago
- ☆50Updated 9 months ago
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆33Updated last year
- ☆46Updated 2 years ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 8 months ago
- ☆62Updated last year
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 8 months ago
- 日本語マルチタ スク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆36Updated last month
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆76Updated 10 months ago
- Multi-Domain Expert Learning☆66Updated last year
- Track the progress of LLM context utilisation☆54Updated 6 months ago