Train toy models using multi-token prediction objective
☆14Apr 18, 2026Updated 2 months ago
Alternatives and similar repositories for multi-token-pred
Users that are interested in multi-token-pred are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆30Dec 10, 2024Updated last year
- Implementation of the Influence Maximization Benchmarker (IMB)☆14Aug 10, 2023Updated 2 years ago
- ☆16Mar 22, 2025Updated last year
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- Code for "Hierarchical Diffusion Attention Network" (IJCAI 2019)☆14Apr 23, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Nov 3, 2023Updated 2 years ago
- ☆11May 27, 2020Updated 6 years ago
- [CIKM 2024] Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation☆15Aug 11, 2024Updated last year
- Pytorch implementation for NeurIPS-23:"GNNEvaluator: Evaluating GNN Performance On Unseen Graphs Without Labels"☆19Mar 21, 2024Updated 2 years ago
- Source code for VLDB 2020 paper "Hypergraph Motifs: Concepts, Algorithms, and Discoveries."☆17Jun 4, 2024Updated 2 years ago
- ☆19Sep 21, 2018Updated 7 years ago
- KDD 2025: "Predicting the Dynamics of Complex Systems via Multiscale Diffusion Autoencoder"☆21Jun 2, 2025Updated last year
- ☆15Mar 15, 2024Updated 2 years ago
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆24Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A flexible & scalable MLLM-based AIGC detection pipeline☆39Jun 16, 2026Updated 2 weeks ago
- Code for A Sequential Neural Information Diffusion Model with Structure Attention (CIKM 2018)☆18Jan 4, 2019Updated 7 years ago
- [AAAI 2024] Official PyTorch Implementation of "Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Dat…☆15May 29, 2025Updated last year
- CasSeqGCN: Combining Network Structure and Temporal Sequence to Predict Information Cascades☆16Nov 21, 2021Updated 4 years ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated 2 years ago
- ☆39Mar 29, 2024Updated 2 years ago
- ☆23Nov 8, 2023Updated 2 years ago
- ☆49Jul 4, 2025Updated last year
- Topological Recurrent Neural Network for Diffusion Prediction☆19Nov 29, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Algorithms for online influence maximization☆24Feb 20, 2017Updated 9 years ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆19Feb 20, 2025Updated last year
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31Jun 22, 2026Updated last week
- Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases☆29Dec 8, 2021Updated 4 years ago
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection☆13Updated this week
- 四大名著☆23Jan 2, 2023Updated 3 years ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- (IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods☆14Aug 23, 2023Updated 2 years ago
- ☆18Jan 25, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Dec 16, 2023Updated 2 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- A reinforcement learning object detector leveraging saliency ranking, offering a self-explainable system with a fully observable action l…☆14May 28, 2025Updated last year
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated 2 years ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆41Jun 24, 2025Updated last year
- Deep universal probabilistic programming with Python and PyTorch☆13Apr 1, 2020Updated 6 years ago
- Paper Automatic Classification Based on Graph Neural Network☆19Jan 10, 2025Updated last year