BestAnHongjun/SentenceVAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BestAnHongjun/SentenceVAE)

BestAnHongjun / SentenceVAE

Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context

☆42

Alternatives and similar repositories for SentenceVAE

Users that are interested in SentenceVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gkevinyen5418 / LoRA-RITE
View on GitHub
☆17Dec 7, 2025Updated 3 months ago
zepingyu0512 / in-context-mechanism
View on GitHub
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Nov 17, 2024Updated last year
TransluceAI / .github
View on GitHub
☆18Dec 12, 2025Updated 3 months ago
shawnlimn / ScaleGrad
View on GitHub
Source code for ScaleGrad
☆19Dec 28, 2021Updated 4 years ago
weipeilun / Nested-Learning-Pytorch
View on GitHub
Unofficial implementation of Google's Nested Learning framework in Pytorch
☆29Updated this week
DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SihengLi99 / SEALONG
View on GitHub
Large Language Models Can Self-Improve in Long-context Reasoning
☆73Nov 24, 2024Updated last year
tianyi-lab / MoE-Embedding
View on GitHub
[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆90Oct 15, 2024Updated last year
HanseulJo / position-coupling
View on GitHub
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…
☆14Oct 26, 2025Updated 5 months ago
BaohaoLiao / ApiQ
View on GitHub
[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
☆15Jul 18, 2024Updated last year
technion-cs-nlp / hallucination-mitigation
View on GitHub
☆23Dec 17, 2024Updated last year
mjy1111 / PEAK
View on GitHub
The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models
☆16May 4, 2024Updated last year
Furyton / GR-as-MVDR
View on GitHub
[SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval
☆36Oct 18, 2024Updated last year
chuanyang-Zheng / DAPE
View on GitHub
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆41Oct 11, 2024Updated last year
seraphlabs-ca / SentenceMIM-demo
View on GitHub
This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"
☆28Jun 22, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
PeterGriffinJin / LMIndexer
View on GitHub
Language Models as Semantic Indexers (ICML 2024)
☆40May 2, 2024Updated last year
nasosger / MuToR
View on GitHub
[NeurIPS '25] Multi-Token Prediction Needs Registers
☆28Dec 14, 2025Updated 3 months ago
dojeon-ai / PLASTIC
View on GitHub
Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)
☆22Dec 8, 2023Updated 2 years ago
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
zzc-1024 / Visual-ANN
View on GitHub
基于蓝图系统的人工神经网络可视化集成开发环境。化繁为简，简单拖拽，就能完成复杂的任务。
☆10Jun 8, 2023Updated 2 years ago
Aurora-cx / EmotionCircuits-LLM
View on GitHub
EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.
☆27Oct 20, 2025Updated 5 months ago
bbdjj / CGCOD
View on GitHub
CGCOD: Class-Guided Camouflaged Object Detection
☆54Oct 16, 2025Updated 5 months ago
karlstratos / ammi
View on GitHub
☆11Jul 15, 2020Updated 5 years ago
TOM-tym / Learn-to-Imagine
View on GitHub
Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”
☆13Jul 25, 2022Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ys1998 / vae-latent-structure
View on GitHub
PyTorch implementation of "Variational Autoencoders with Jointly Optimized Latent Dependency Structure" [ICLR 2019]
☆13Jul 14, 2019Updated 6 years ago
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
SethEBaldwin / mdscuda
View on GitHub
CUDA implementation of Multidimensional Scaling
☆15May 8, 2021Updated 4 years ago
joewongjc / feishu-claude-code
View on GitHub
Bridge Claude Code CLI with Feishu/Lark via WebSocket. 飞书 × Claude Code 实时对话。
☆29Updated this week
UCSC-VLAA / ReasoningEval
View on GitHub
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆44Jun 6, 2025Updated 9 months ago
huangch / qust
View on GitHub
QuST: QuPath Extension for Integrative Whole Slide Image and Spatial Transcriptomics Analysis
☆34Jun 22, 2025Updated 9 months ago
matchten / LoRA-Models-for-SAEs
View on GitHub
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Mar 31, 2025Updated 11 months ago
duterscmy / CD-MoE
View on GitHub
Official PyTorch implementation of CD-MOE
☆12Mar 18, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Shuvomoy / BnB-PEP-code
View on GitHub
☆21Jun 1, 2025Updated 9 months ago
ta012 / DTFAT
View on GitHub
[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
☆12Mar 10, 2025Updated last year
Fu-Dayuan / PreAct
View on GitHub
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆30Dec 12, 2024Updated last year
PreferredAI / compare-lda
View on GitHub
A Topic Model for Document Comparison
☆14Aug 23, 2019Updated 6 years ago
BugMakerzzz / toxic_cot
View on GitHub
☆12Feb 28, 2025Updated last year
ShiyuNee / Awesome-LMs-Perception-of-Their-Knowledge-Boundaries-Papers
View on GitHub
This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …
☆24Nov 25, 2025Updated 4 months ago
jacobdunefsky / llm-steering-opt
View on GitHub
Tools for optimizing steering vectors in LLMs.
☆20Apr 10, 2025Updated 11 months ago