nahidalam / mayaLinks

Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya

☆117

Alternatives and similar repositories for maya

Users that are interested in maya are comparing it to the libraries listed below

Sorting:

allenai / infinigram-api
☆69Updated last month
facebookresearch / collaborative-reasoner
Source code for the collaborative reasoner research project at Meta FAIR.
☆94Updated 2 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆64Updated 2 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆54Updated 5 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 5 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆96Updated 4 months ago
arcee-ai / DAM
☆52Updated 8 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆84Updated 2 weeks ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 10 months ago
allenai / IFBench
☆52Updated last week
writer / writing-in-the-margins
☆118Updated 10 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆80Updated 2 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 2 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆81Updated last month
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆130Updated last month
goncalorafaria / qalign
QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆23Updated 3 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆73Updated last week
zjunlp / DynamicKnowledgeCircuits
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
☆36Updated 2 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 5 months ago
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆58Updated 7 months ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆71Updated 3 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆209Updated this week
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆45Updated 5 months ago
AK391 / dailypapersHN
☆86Updated 9 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
reka-ai / reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
☆170Updated 6 months ago
huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆80Updated last month
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆111Updated 5 months ago
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆183Updated 5 months ago
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆92Updated 3 months ago