shisa-ai / shisa-v2Links

Japanese / English Bilingual LLM

☆24

Alternatives and similar repositories for shisa-v2

Users that are interested in shisa-v2 are comparing it to the libraries listed below

Sorting:

Aratako / Task-Vector-Merge-Optimzier
☆14Updated last year
mungg / FABLES
☆57Updated 10 months ago
adityasoni9998 / OpenHands-Versa
Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"
☆69Updated last week
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
neodyland / entropix
Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral
☆17Updated 6 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
llm-jp / llm-jp-corpus
☆42Updated last year
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 5 months ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 10 months ago
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆90Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 9 months ago
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆45Updated 3 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
DunZhang / Stella
☆62Updated last year
allenai / infinigram-api
☆73Updated 3 weeks ago
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆55Updated 3 months ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
C-J-Cundy / gpt4-tokenizer
Hosting the JSON for the GPT4 Tokenizer
☆64Updated 2 years ago
huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆83Updated this week
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
allenai / adapt-demos
Lightweight tools for quick and easy LLM demo's
☆28Updated 10 months ago
wandb / llm-leaderboard
Project of llm evaluation to Japanese tasks
☆86Updated this week
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆101Updated last year