bjoernpl / KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
☆27Updated last year
Related projects: ⓘ
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆73Updated 11 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last month
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆33Updated last week
- An Experiment on Dynamic NTK Scaling RoPE☆59Updated 9 months ago
- Transformers at any scale☆39Updated 8 months ago
- Unofficial implementation of AlpaGasus☆83Updated 11 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆47Updated 2 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆48Updated last week
- ☆17Updated 11 months ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆58Updated 2 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion