Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context
☆42Aug 16, 2024Updated last year
Alternatives and similar repositories for SentenceVAE
Users that are interested in SentenceVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Dec 7, 2025Updated 3 months ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- ☆18Dec 12, 2025Updated 3 months ago
- Source code for ScaleGrad☆19Dec 28, 2021Updated 4 years ago
- Unofficial implementation of Google's Nested Learning framework in Pytorch☆29Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Large Language Models Can Self-Improve in Long-context Reasoning☆73Nov 24, 2024Updated last year
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆90Oct 15, 2024Updated last year
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…☆14Oct 26, 2025Updated 5 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- ☆23Dec 17, 2024Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Oct 11, 2024Updated last year
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Jun 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Language Models as Semantic Indexers (ICML 2024)☆40May 2, 2024Updated last year
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆28Dec 14, 2025Updated 3 months ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆22Dec 8, 2023Updated 2 years ago
- ☆25Oct 31, 2024Updated last year
- 基于蓝图系统的人工神经网络可视化集成开发环境。化繁为简,简单拖拽,就能完成复杂的任务。☆10Jun 8, 2023Updated 2 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆27Oct 20, 2025Updated 5 months ago
- CGCOD: Class-Guided Camouflaged Object Detection☆54Oct 16, 2025Updated 5 months ago
- ☆11Jul 15, 2020Updated 5 years ago
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PyTorch implementation of "Variational Autoencoders with Jointly Optimized Latent Dependency Structure" [ICLR 2019]☆13Jul 14, 2019Updated 6 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- ☆33Nov 11, 2024Updated last year
- CUDA implementation of Multidimensional Scaling☆15May 8, 2021Updated 4 years ago
- Bridge Claude Code CLI with Feishu/Lark via WebSocket. 飞书 × Claude Code 实时对话。☆29Updated this week
- Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.☆44Jun 6, 2025Updated 9 months ago
- QuST: QuPath Extension for Integrative Whole Slide Image and Spatial Transcriptomics Analysis☆34Jun 22, 2025Updated 9 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆21Jun 1, 2025Updated 9 months ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- A Topic Model for Document Comparison☆14Aug 23, 2019Updated 6 years ago
- ☆12Feb 28, 2025Updated last year
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆24Nov 25, 2025Updated 4 months ago
- Tools for optimizing steering vectors in LLMs.☆20Apr 10, 2025Updated 11 months ago