bjoernpl / KOSMOS_reimplementation
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
☆27Updated 2 years ago
Alternatives and similar repositories for KOSMOS_reimplementation:
Users that are interested in KOSMOS_reimplementation are comparing it to the libraries listed below
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 5 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 9 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- ☆27Updated last month
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated 5 months ago
- Transformers at any scale☆41Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆88Updated last month
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆14Updated 2 years ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆56Updated last month
- ☆16Updated 9 months ago
- List of papers on Self-Correction of LLMs.☆72Updated 4 months ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆82Updated last year
- ☆51Updated last year
- Tools for content datamining and NLP at scale☆43Updated 10 months ago
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆22Updated last year
- ☆17Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 4 months ago
- ☆48Updated last year
- ☆64Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆45Updated 5 months ago
- M4 experiment logbook☆57Updated last year
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆27Updated last year
- ☆34Updated 9 months ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆63Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆67Updated last year