Train toy models using multi-token prediction objective
☆14Apr 18, 2026Updated last month
Alternatives and similar repositories for multi-token-pred
Users that are interested in multi-token-pred are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆30Dec 10, 2024Updated last year
- lncRNA-Py is a development package for applying machine learning and deep learning to the problem of lncRNA classification, i.e. predicti…☆12Jan 24, 2025Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- This directory contains the MATLAB code for the paper Reconstructing higher-order interactions in coupled dynamical systems by Federico M…☆11May 2, 2024Updated 2 years ago
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Nov 3, 2023Updated 2 years ago
- [CIKM 2024] Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation☆14Aug 11, 2024Updated last year
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Sep 22, 2024Updated last year
- Provide RNA and DNA Foundation Model Benchmarks and Applications☆29Nov 26, 2025Updated 5 months ago
- Source code for VLDB 2020 paper "Hypergraph Motifs: Concepts, Algorithms, and Discoveries."☆17Jun 4, 2024Updated last year
- 基于BERT预训练模型使用pythorch训练文本分类模型☆19Dec 26, 2023Updated 2 years ago
- KDD 2025: "Predicting the Dynamics of Complex Systems via Multiscale Diffusion Autoencoder"☆21Jun 2, 2025Updated 11 months ago
- ☆15Mar 15, 2024Updated 2 years ago
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆23Apr 27, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆34May 10, 2026Updated 2 weeks ago
- [AAAI 2024] Official PyTorch Implementation of "Unknown-Aware Graph Regularization for Robust Semi-supervised Learning from Uncurated Dat…☆15May 29, 2025Updated 11 months ago
- CasSeqGCN: Combining Network Structure and Temporal Sequence to Predict Information Cascades☆16Nov 21, 2021Updated 4 years ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated 2 years ago
- ☆39Mar 29, 2024Updated 2 years ago
- CIFAR10 ResNets implemented in JAX+Flax☆12Apr 6, 2022Updated 4 years ago
- ☆23Nov 8, 2023Updated 2 years ago
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- ☆10May 23, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Algorithms for online influence maximization☆24Feb 20, 2017Updated 9 years ago
- Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases☆29Dec 8, 2021Updated 4 years ago
- 四大名著☆21Jan 2, 2023Updated 3 years ago
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection☆13Jun 17, 2025Updated 11 months ago
- VQ-VAE implementation pytorch☆11Mar 15, 2023Updated 3 years ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆27Jun 5, 2024Updated last year
- ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark☆55Sep 2, 2025Updated 8 months ago
- (IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods☆14Aug 23, 2023Updated 2 years ago
- Chinese notes of SplaTam(3DGS-based SLAM)☆15Feb 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Jan 12, 2023Updated 3 years ago
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated last year
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆41Jun 24, 2025Updated 11 months ago
- 2D Gaussian splatting for image compression☆19Nov 29, 2024Updated last year
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆30Apr 21, 2025Updated last year
- website repo for agent-based social movement simulation☆27Jun 17, 2024Updated last year
- ☆26Oct 17, 2020Updated 5 years ago