mj-storytelling / DiversityTuningLinks
Modifying Large Language Models Post-training for Diverse Creative Writing
☆50Updated 4 months ago
Alternatives and similar repositories for DiversityTuning
Users that are interested in DiversityTuning are comparing it to the libraries listed below
Sorting:
- ☆50Updated 3 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 6 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated 9 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆85Updated last year
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆15Updated 7 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 7 months ago
- This is the official repository for Inheritune.☆113Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated last year
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆77Updated 9 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆31Updated 2 weeks ago
- ☆86Updated 8 months ago
- A repository for research on medium sized language models.☆77Updated last year
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆50Updated last year
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆39Updated 11 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆104Updated 4 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆19Updated 5 months ago
- ☆71Updated 9 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆90Updated 10 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 6 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆34Updated 3 weeks ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆92Updated 3 weeks ago
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]☆77Updated 2 months ago
- ☆24Updated 7 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆26Updated 2 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆71Updated 5 months ago
- Multimodal language model benchmark, featuring challenging examples☆176Updated 9 months ago
- The open-source code of MetaStone-S1.☆108Updated last month
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆86Updated 11 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated last year