nahidalam / mayaLinks
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆125Updated 6 months ago
Alternatives and similar repositories for maya
Users that are interested in maya are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- ☆53Updated last year
- ☆120Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆91Updated last month
- ☆56Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆112Updated 8 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆90Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆115Updated 9 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- Code for ExploreTom☆90Updated 7 months ago
- Multimodal language model benchmark, featuring challenging examples☆183Updated last year
- The first dense retrieval model that can be prompted like an LM☆90Updated 9 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆95Updated this week
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆135Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- Data recipes and robust infrastructure for training AI agents☆94Updated this week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- [TMLR 2026] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models☆122Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆59Updated last year
- PyTorch implementation of models from the Zamba2 series.☆186Updated last year
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆142Updated 3 months ago
- ☆48Updated last year
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- ☆141Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated last week