Train toy models using multi-token prediction objective
☆14Apr 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for multi-token-pred
Users that are interested in multi-token-pred are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆30Dec 10, 2024Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- Implementation of the Influence Maximization Benchmarker (IMB)☆14Aug 10, 2023Updated 2 years ago
- ☆16Mar 22, 2025Updated last year
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- ☆11May 27, 2020Updated 5 years ago
- [CIKM 2024] Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation☆14Aug 11, 2024Updated last year
- Pytorch implementation for NeurIPS-23:"GNNEvaluator: Evaluating GNN Performance On Unseen Graphs Without Labels"☆19Mar 21, 2024Updated 2 years ago
- Multi-scale Information Diffusion Prediction with Sequential Hypergraphs☆13Apr 13, 2024Updated 2 years ago
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago
- ☆14Mar 15, 2024Updated 2 years ago
- visualization program for vlp-16 based on a viz class☆11Feb 8, 2017Updated 9 years ago
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆22Apr 27, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆34Apr 15, 2026Updated 2 weeks ago
- CasSeqGCN: Combining Network Structure and Temporal Sequence to Predict Information Cascades☆16Nov 21, 2021Updated 4 years ago
- ☆42Jul 4, 2025Updated 10 months ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated last year
- ☆39Mar 29, 2024Updated 2 years ago
- CIFAR10 ResNets implemented in JAX+Flax☆12Apr 6, 2022Updated 4 years ago
- ☆23Nov 8, 2023Updated 2 years ago
- Topological Recurrent Neural Network for Diffusion Prediction☆19Nov 29, 2017Updated 8 years ago
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10May 23, 2022Updated 3 years ago
- Algorithms for online influence maximization☆24Feb 20, 2017Updated 9 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆30Apr 13, 2026Updated 3 weeks ago
- Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases☆29Dec 8, 2021Updated 4 years ago
- VQ-VAE implementation pytorch☆11Mar 15, 2023Updated 3 years ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆27Jun 5, 2024Updated last year
- ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark☆53Sep 2, 2025Updated 8 months ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.☆52Mar 8, 2026Updated last month
- 2D Gaussian splatting for image compression☆18Nov 29, 2024Updated last year
- ☆11Dec 16, 2023Updated 2 years ago
- Chinese notes of SplaTam(3DGS-based SLAM)☆15Feb 23, 2025Updated last year
- ☆11Jan 12, 2023Updated 3 years ago
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated last year
- Paper Automatic Classification Based on Graph Neural Network☆19Jan 10, 2025Updated last year