bjoernpl / KOSMOS_reimplementationLinks
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
☆27Updated 2 years ago
Alternatives and similar repositories for KOSMOS_reimplementation
Users that are interested in KOSMOS_reimplementation are comparing it to the libraries listed below
Sorting:
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆41Updated 7 months ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆57Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 11 months ago
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆76Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆90Updated 2 months ago
- Unofficial implementation of AlpaGasus☆91Updated last year
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆22Updated last year
- Transformers at any scale☆41Updated last year
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- ☆96Updated 2 years ago
- Tools for content datamining and NLP at scale☆43Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆78Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- M4 experiment logbook☆58Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Updated last year
- ☆35Updated last year
- List of papers on Self-Correction of LLMs.☆73Updated 6 months ago
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆67Updated last year
- ☆37Updated 2 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated 2 months ago
- ☆17Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆91Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated 7 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆48Updated 6 months ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago