Train toy models using multi-token prediction objective
☆14May 8, 2024Updated last year
Alternatives and similar repositories for multi-token-pred
Users that are interested in multi-token-pred are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆29Dec 10, 2024Updated last year
- lncRNA-Py is a development package for applying machine learning and deep learning to the problem of lncRNA classification, i.e. predicti…☆12Jan 24, 2025Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- ☆10Oct 29, 2020Updated 5 years ago
- Code used in Tiukhova et al. (2022). Influencer Detection with Dynamic Graph Neural Networks. TGL@Neurips 2022.☆11Feb 9, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Mar 22, 2025Updated last year
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated last year
- Code for "Hierarchical Diffusion Attention Network" (IJCAI 2019)☆14Apr 23, 2020Updated 5 years ago
- ☆11May 27, 2020Updated 5 years ago
- [CIKM 2024] Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation☆14Aug 11, 2024Updated last year
- Provide RNA and DNA Foundation Model Benchmarks and Applications☆27Nov 26, 2025Updated 4 months ago
- A flexible & scalable MLLM-based AIGC detection pipeline☆31Oct 27, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code for VLDB 2020 paper "Hypergraph Motifs: Concepts, Algorithms, and Discoveries."☆17Jun 4, 2024Updated last year
- ☆19Sep 21, 2018Updated 7 years ago
- for all, home☆16Updated this week
- Multi-scale Information Diffusion Prediction with Sequential Hypergraphs☆13Apr 13, 2024Updated 2 years ago
- KDD 2025: "Predicting the Dynamics of Complex Systems via Multiscale Diffusion Autoencoder"☆19Jun 2, 2025Updated 10 months ago
- DISCO: Influence Maximization Meets Graph Embedding and Deep Learning (a.k.a. PIANO)☆16Apr 1, 2022Updated 4 years ago
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆32Feb 24, 2026Updated last month
- Code for A Sequential Neural Information Diffusion Model with Structure Attention (CIKM 2018)☆18Jan 4, 2019Updated 7 years ago
- We release our code and data for SEAS in this repository.☆21Dec 23, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CasSeqGCN: Combining Network Structure and Temporal Sequence to Predict Information Cascades☆16Nov 21, 2021Updated 4 years ago
- ☆40Jul 4, 2025Updated 9 months ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated last year
- ☆23Nov 8, 2023Updated 2 years ago
- Topological Recurrent Neural Network for Diffusion Prediction☆19Nov 29, 2017Updated 8 years ago
- ☆10May 23, 2022Updated 3 years ago
- Algorithms for online influence maximization☆24Feb 20, 2017Updated 9 years ago
- Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning☆72Dec 30, 2025Updated 3 months ago
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [TGRS 2023] Official code for "EARL: An Elliptical Distribution aided Adaptive Label Assignment for Oriented Object Detection in Remote S…☆14Oct 11, 2023Updated 2 years ago
- VQ-VAE implementation pytorch☆11Mar 15, 2023Updated 3 years ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆27Jun 5, 2024Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- 2D Gaussian splatting for image compression☆18Nov 29, 2024Updated last year
- ☆11Jan 12, 2023Updated 3 years ago
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated last year