Train toy models using multi-token prediction objective
☆14May 8, 2024Updated last year
Alternatives and similar repositories for multi-token-pred
Users that are interested in multi-token-pred are comparing it to the libraries listed below
Sorting:
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Dec 10, 2024Updated last year
- ☆10Oct 29, 2020Updated 5 years ago
- ☆12May 26, 2022Updated 3 years ago
- lncRNA-Py is a development package for applying machine learning and deep learning to the problem of lncRNA classification, i.e. predicti…☆12Jan 24, 2025Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- [TGRS 2023] Official code for "EARL: An Elliptical Distribution aided Adaptive Label Assignment for Oriented Object Detection in Remote S…☆14Oct 11, 2023Updated 2 years ago
- ☆10May 23, 2022Updated 3 years ago
- ☆18Aug 20, 2024Updated last year
- Implementation of the Influence Maximization Benchmarker (IMB)☆14Aug 10, 2023Updated 2 years ago
- 2D Gaussian splatting for image compression☆18Nov 29, 2024Updated last year
- Implementation about a recommender System using RQ-VAE Semantic IDs☆16Aug 11, 2025Updated 6 months ago
- ☆10Nov 3, 2023Updated 2 years ago
- ☆15Apr 11, 2023Updated 2 years ago
- ☆10Jun 28, 2025Updated 8 months ago
- ☆13Apr 3, 2024Updated last year
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated last year
- ☆15Jan 25, 2025Updated last year
- Lane segmentation model trained with tensorflow implementation MobileNetV2 based U-Net☆11Mar 24, 2023Updated 2 years ago
- Q&A dataset for many-shot jailbreaking☆14Jul 19, 2024Updated last year
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago
- Hand Written Blots augmentation☆12Aug 28, 2025Updated 6 months ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- ☆11May 27, 2020Updated 5 years ago
- ☆10Apr 20, 2018Updated 7 years ago
- Using DTensor on Google Cloud☆18Sep 18, 2022Updated 3 years ago
- This directory contains the MATLAB code for the paper Reconstructing higher-order interactions in coupled dynamical systems by Federico M…☆12May 2, 2024Updated last year
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- Offical Implementation for Oriented Object Detection via Contextual Dependence Mining and Penalty-Incentive Allocation☆10Dec 20, 2023Updated 2 years ago
- ☆16Mar 22, 2025Updated 11 months ago
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- Weighted-Boxes-Fusion method implementation with YOLOv4 and YOLOv5☆11Jul 14, 2022Updated 3 years ago
- Reinforcement Learning from Hierarchical Critics☆13Jul 30, 2020Updated 5 years ago
- HSTU-BLaIR: Lightweight Contrastive Text Embedding for Generative Recommender 🌱☆21Jul 4, 2025Updated 8 months ago
- A sample client code for capturing panorama images by a modified AirSim☆14Aug 20, 2022Updated 3 years ago
- ☆13Jun 8, 2019Updated 6 years ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Cute layout visualization☆30Jan 18, 2026Updated last month
- The official repos of "Knowledge Bridger: Towards Training-Free Missing Modality Completion"☆21Jun 30, 2025Updated 8 months ago