bjoernpl / KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
☆26Updated last year
Alternatives and similar repositories for KOSMOS_reimplementation:
Users that are interested in KOSMOS_reimplementation are comparing it to the libraries listed below
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 7 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- ☆96Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆87Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆27Updated this week
- Tools for content datamining and NLP at scale☆42Updated 8 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 3 months ago
- ☆49Updated last year
- Transformers at any scale☆41Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆62Updated 3 months ago
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆14Updated 2 years ago
- Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆27Updated 6 months ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆27Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated 11 months ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 5 months ago
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆21Updated 11 months ago
- M4 experiment logbook☆57Updated last year
- ☆37Updated last year
- ☆15Updated 7 months ago
- ZYN: Zero-Shot Reward Models with Yes-No Questions☆33Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆53Updated 2 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆28Updated 3 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 8 months ago
- ☆17Updated 9 months ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆82Updated last year