sebastian-nehrdich / byt5-sanskrit-analyzersLinks
☆22Updated last year
Alternatives and similar repositories for byt5-sanskrit-analyzers
Users that are interested in byt5-sanskrit-analyzers are comparing it to the libraries listed below
Sorting:
- Align various Sanskrit texts and audio☆16Updated 4 months ago
- ☆16Updated 2 years ago
- ☆106Updated 7 months ago
- ☆173Updated 4 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆122Updated 2 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆51Updated last month
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 11 months ago
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆16Updated last year
- Python Interface to Cologne Digital Sanskrit Lexicon (CDSL)☆16Updated 3 years ago
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆47Updated 2 months ago
- CVPR 2025 Workshop on CVEU.☆42Updated 5 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆15Updated last year
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆180Updated 6 months ago
- ☆54Updated last week
- ☆185Updated 5 months ago
- NEO Series: Native Vision-Language Models from First Principles☆223Updated 3 weeks ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Updated last year
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆34Updated last year
- ☆10Updated 8 months ago
- Does patch ordering affect context-limited vision transformers?☆15Updated last month
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆83Updated 3 months ago
- ☆53Updated 6 months ago
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆92Updated last year
- Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025☆109Updated 2 months ago
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆80Updated 5 months ago
- Vaiyyākaraṇaḥ is a telegram bot that offers various tools for a Sanskrit learner including stem (प्रातिपदिकम्) finder, root (धातुः) finde…☆15Updated last year
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆30Updated 4 months ago
- [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos☆48Updated 5 months ago
- High order Moment Models☆42Updated last week