Train toy models using multi-token prediction objective
☆14May 8, 2024Updated last year
Alternatives and similar repositories for multi-token-pred
Users that are interested in multi-token-pred are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆29Dec 10, 2024Updated last year
- lncRNA-Py is a development package for applying machine learning and deep learning to the problem of lncRNA classification, i.e. predicti…☆12Jan 24, 2025Updated last year
- Track 5: Cross-Platform 3D Object Detection☆21Aug 16, 2025Updated 7 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- ☆10Oct 29, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of the Influence Maximization Benchmarker (IMB)☆14Aug 10, 2023Updated 2 years ago
- Code used in Tiukhova et al. (2022). Influencer Detection with Dynamic Graph Neural Networks. TGL@Neurips 2022.☆11Feb 9, 2023Updated 3 years ago
- This directory contains the MATLAB code for the paper Reconstructing higher-order interactions in coupled dynamical systems by Federico M…☆11May 2, 2024Updated last year
- ☆16Mar 22, 2025Updated last year
- ☆15Jan 23, 2025Updated last year
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- Manually construct IP, TCP, UDP, and ICMP packets based on DPDK, commonly used for packet simulation, network security attack testing, fi…☆10Nov 29, 2024Updated last year
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- A proofreading tool using Google's N-gram corpus.☆12Sep 2, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MCP tools to connect LLMs and ABACUS jobs☆19Mar 13, 2026Updated last week
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated last year
- The official repository for the paper "Real-world Reinforcement Learning from Suboptimal Interventions”.☆39Updated this week
- ☆10Nov 3, 2023Updated 2 years ago
- Code for "Hierarchical Diffusion Attention Network" (IJCAI 2019)☆14Apr 23, 2020Updated 5 years ago
- ☆11May 27, 2020Updated 5 years ago
- [CIKM 2024] Do We Really Need Graph Convolution During Training? Light Post-Training Graph-ODE for Efficient Recommendation☆14Aug 11, 2024Updated last year
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Sep 22, 2024Updated last year
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Provide RNA and DNA Foundation Model Benchmarks and Applications☆26Nov 26, 2025Updated 3 months ago
- A flexible & scalable MLLM-based AIGC detection pipeline☆31Oct 27, 2025Updated 4 months ago
- Pytorch implementation for NeurIPS-23:"GNNEvaluator: Evaluating GNN Performance On Unseen Graphs Without Labels"☆19Mar 21, 2024Updated 2 years ago
- Source code for VLDB 2020 paper "Hypergraph Motifs: Concepts, Algorithms, and Discoveries."☆17Jun 4, 2024Updated last year
- ☆19Sep 21, 2018Updated 7 years ago
- Single-sequence and Profile-based Prediction of RNA Solvent Accessibility Using Dilated Convolution Neural Network☆13Jun 22, 2022Updated 3 years ago
- for all, home☆16Mar 6, 2026Updated 2 weeks ago
- Multi-scale Information Diffusion Prediction with Sequential Hypergraphs☆13Apr 13, 2024Updated last year
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆31Feb 24, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated last year
- 基于BERT预训练模型使用pythorch训练文本分类模型☆18Dec 26, 2023Updated 2 years ago
- ☆14Mar 15, 2024Updated 2 years ago
- KDD 2025: "Predicting the Dynamics of Complex Systems via Multiscale Diffusion Autoencoder"☆19Jun 2, 2025Updated 9 months ago
- DISCO: Influence Maximization Meets Graph Embedding and Deep Learning (a.k.a. PIANO)☆16Apr 1, 2022Updated 3 years ago
- Using xml to define pytorch neural networks☆15Jan 24, 2019Updated 7 years ago
- visualization program for vlp-16 based on a viz class☆11Feb 8, 2017Updated 9 years ago