bjoernpl / KOSMOS_reimplementationLinks

A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"

☆27

Alternatives and similar repositories for KOSMOS_reimplementation

Users that are interested in KOSMOS_reimplementation are comparing it to the libraries listed below

Sorting:

kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
wade3han / champagne
An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"
☆52Updated last year
Dahoas / reward-modeling
☆96Updated 2 years ago
thu-coai / PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
☆107Updated last year
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated 2 years ago
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
NormXU / Consistent-DynamicNTKRoPE
An Experiment on Dynamic NTK Scaling RoPE
☆64Updated last year
kyegomez / phi-1
Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation
☆76Updated last year
deep-spin / infinite-former
☆65Updated 11 months ago
GeneZC / MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆101Updated last year
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆162Updated 2 months ago
austrian-code-wizard / c3po
☆29Updated this week
gpt4life / alpagasus
Unofficial implementation of AlpaGasus
☆92Updated last year
sunyt32 / torchscale
Transformers at any scale
☆41Updated last year
huggingface / m4-logs
M4 experiment logbook
☆58Updated last year
LAION-AI / riverbed
Tools for content datamining and NLP at scale
☆43Updated last year
Moocember / Optimization-by-PROmpting
☆78Updated last year
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Updated 10 months ago
vicgalle / zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
☆34Updated last year
DAMO-NLP-SG / CLEX
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Updated last year
nttmdlab-nlp / SlideVQA
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
☆92Updated 4 months ago
locuslab / scaling_laws_data_filtering
☆65Updated last year
sheryc / resonance_rope
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.
☆22Updated last year
feyzaakyurek / rl4f
Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
☆64Updated 8 months ago
Lightning-Universe / lightning-ColossalAI
Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI
☆57Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
dhansmair / flamingo-mini
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
☆167Updated 2 years ago
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆79Updated last year
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
OpenLMLab / scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
☆73Updated last year