Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context
☆41Aug 16, 2024Updated last year
Alternatives and similar repositories for SentenceVAE
Users that are interested in SentenceVAE are comparing it to the libraries listed below
Sorting:
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- ☆16Dec 7, 2025Updated 2 months ago
- ☆13Oct 3, 2022Updated 3 years ago
- some object detection algo☆14Jul 25, 2024Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆27Dec 14, 2025Updated 2 months ago
- ☆18Dec 12, 2025Updated 2 months ago
- ☆25Oct 31, 2024Updated last year
- QuST: QuPath Extension for Integrative Whole Slide Image and Spatial Transcriptomics Analysis☆34Jun 22, 2025Updated 8 months ago
- ☆23Dec 17, 2024Updated last year
- Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…☆27Jul 27, 2024Updated last year
- ☆11Mar 13, 2025Updated 11 months ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- ☆32Nov 11, 2024Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Nov 24, 2024Updated last year
- RL with Experience Replay☆55Jul 27, 2025Updated 7 months ago
- ☆37Dec 19, 2024Updated last year
- Convolutional Channel-wise Competitive Learning for the Forward-Forward Algorithm. AAAI 2024☆11Jun 27, 2024Updated last year
- ☆13Dec 13, 2024Updated last year
- ☆10Aug 9, 2023Updated 2 years ago
- Deno Library to upload files to GCS and obtain signed url☆11Jan 16, 2024Updated 2 years ago
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆38Sep 24, 2024Updated last year
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆40Sep 22, 2024Updated last year
- Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"☆40May 1, 2025Updated 10 months ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- Base implementation of the Multi-Encoder Variational AutoEncoder (ME-VAE)☆10Feb 28, 2022Updated 4 years ago
- A Generative Adversarial Network Model Alternative to Animal Studies for Clinical Pathology Assessment☆14Jan 10, 2024Updated 2 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Oct 11, 2024Updated last year
- 《大语言模型》综述全书学习笔记☆13Aug 2, 2024Updated last year
- ☆12Jun 19, 2024Updated last year
- ☆35Mar 25, 2024Updated last year
- ☆47Nov 8, 2024Updated last year
- Source code for "A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models"☆44Nov 27, 2022Updated 3 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- Smart contracts for a home rental network with IoT doorlocks☆11Jun 5, 2018Updated 7 years ago