☆75Dec 6, 2024Updated last year
Alternatives and similar repositories for LLaMAX
Users that are interested in LLaMAX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 6 months ago
- ☆40Jan 23, 2024Updated 2 years ago
- ☆10Mar 22, 2024Updated 2 years ago
- Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows"☆40Nov 10, 2025Updated 5 months ago
- ☆48Oct 2, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for Research Project TLDR☆25Jul 28, 2025Updated 8 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Jul 7, 2025Updated 9 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆11May 10, 2024Updated last year
- ☆18Aug 28, 2025Updated 7 months ago
- [ICLR 2026] JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence☆79Feb 9, 2026Updated 2 months ago
- Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)☆62Feb 13, 2024Updated 2 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆59Jun 1, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACL 2024] The project of Symbol-LLM☆59Jul 10, 2024Updated last year
- ☆12Aug 31, 2021Updated 4 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- ☆27Jul 25, 2023Updated 2 years ago
- ☆17Jan 12, 2023Updated 3 years ago
- The model, data and code for the visual GUI Agent SeeClick☆478Jul 13, 2025Updated 9 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 9 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated 2 years ago
- ↔️ T5 Machine Translation from English to Korean☆18Aug 11, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆35Feb 10, 2025Updated last year
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- Sparse Multilabel Categorical Crossentropy☆11Sep 10, 2023Updated 2 years ago
- [COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration☆30Oct 18, 2024Updated last year
- ☆13May 9, 2023Updated 2 years ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆37Dec 13, 2025Updated 4 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 10 months ago
- ☆11Mar 4, 2026Updated last month
- ☆12Aug 8, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆10Mar 11, 2024Updated 2 years ago
- Viscacha:通用信息抽取数据集收集☆27Feb 21, 2024Updated 2 years ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated 2 months ago
- [ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?☆10Dec 15, 2025Updated 4 months ago
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆26May 12, 2024Updated last year