eric-prog / GPU-GrantsLinks
GPUGrants - a list of GPU grants that I can think of
☆23Updated 2 months ago
Alternatives and similar repositories for GPU-Grants
Users that are interested in GPU-Grants are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Collection of autoregressive model implementation☆85Updated last month
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆61Updated 3 weeks ago
- ☆46Updated last month
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆105Updated 6 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆26Updated 7 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆41Updated last month
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 2 months ago
- A basic pure pytorch implementation of flash attention☆16Updated 7 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆71Updated 8 months ago
- Code for Zero-Shot Tokenizer Transfer☆128Updated 4 months ago
- Code for ExploreTom☆83Updated 5 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 5 months ago
- ☆18Updated 2 weeks ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 3 months ago
- ☆40Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- ☆51Updated last year
- ☆22Updated 5 months ago
- ☆65Updated 2 months ago
- ☆120Updated 8 months ago
- ☆19Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆139Updated 2 weeks ago
- ☆49Updated 7 months ago
- YesBut - Multimodal Satire Comprehension Dataset☆17Updated 7 months ago
- We study toy models of skill learning.☆28Updated 4 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆37Updated last year