cavedweller509/SentenceVAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cavedweller509/SentenceVAE)

cavedweller509 / SentenceVAE

Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context

☆42

Alternatives and similar repositories for SentenceVAE

Users that are interested in SentenceVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kimyuji / EvolvingQA_benchmark
View on GitHub
Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)
☆10Oct 16, 2024Updated last year
TransluceAI / .github
View on GitHub
☆19Dec 12, 2025Updated 7 months ago
zepingyu0512 / arithmetic-mechanism
View on GitHub
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
shawnlimn / ScaleGrad
View on GitHub
Source code for ScaleGrad
☆19Dec 28, 2021Updated 4 years ago
gkevinyen5418 / LoRA-RITE
View on GitHub
☆19Dec 7, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SihengLi99 / SEALONG
View on GitHub
Large Language Models Can Self-Improve in Long-context Reasoning
☆72Nov 24, 2024Updated last year
BaohaoLiao / ApiQ
View on GitHub
[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
☆15Jul 18, 2024Updated 2 years ago
tianyi-lab / MoE-Embedding
View on GitHub
[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆92Oct 15, 2024Updated last year
ForrestPi / ObjectDetection
View on GitHub
some object detection algo
☆14Jul 25, 2024Updated last year
mjy1111 / PEAK
View on GitHub
The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models
☆16May 4, 2024Updated 2 years ago
Furyton / GR-as-MVDR
View on GitHub
[SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval
☆36Oct 18, 2024Updated last year
seraphlabs-ca / SentenceMIM-demo
View on GitHub
This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"
☆28Jun 22, 2022Updated 4 years ago
chuanyang-Zheng / DAPE
View on GitHub
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆41Oct 11, 2024Updated last year
HanseulJo / position-coupling
View on GitHub
Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure (NeurIPS 2024) + Arithmetic Transfor…
☆14Oct 26, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
zzc-1024 / Visual-ANN
View on GitHub
基于蓝图系统的人工神经网络可视化集成开发环境。化繁为简，简单拖拽，就能完成复杂的任务。
☆10Jun 8, 2023Updated 3 years ago
SciFracX / FractionalTransforms.jl
View on GitHub
FractionalTransforms.jl: A Julia package aiming at providing fractional order transforms with high performance.
☆16Jun 17, 2026Updated last month
etimush / ARC_NCA
View on GitHub
Repo for solving arc problems with an Neural Cellular Automata
☆27Mar 9, 2026Updated 4 months ago
ys1998 / vae-latent-structure
View on GitHub
PyTorch implementation of "Variational Autoencoders with Jointly Optimized Latent Dependency Structure" [ICLR 2019]
☆13Jul 14, 2019Updated 7 years ago
cvenhoff / vlm-mapping
View on GitHub
☆19Jun 20, 2025Updated last year
aicheung / option-data-service
View on GitHub
Obtain options data from Interactive Brokers (IBKR) API
☆10Nov 11, 2022Updated 3 years ago
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
UCSB-AI / ComCLIP
View on GitHub
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆37Aug 18, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
OPTML-Group / DP4TL
View on GitHub
[NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…
☆14Oct 12, 2023Updated 2 years ago
SethEBaldwin / mdscuda
View on GitHub
CUDA implementation of Multidimensional Scaling
☆15May 8, 2021Updated 5 years ago
sungnyun / cav2vec
View on GitHub
(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆16Apr 29, 2025Updated last year
claCase / Attention-as-RNN
View on GitHub
Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…
☆28Jul 27, 2024Updated last year
jugechengzi / Rationalization-MGR
View on GitHub
ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"
☆10Nov 21, 2024Updated last year
Shuvomoy / BnB-PEP-code
View on GitHub
☆21Jun 1, 2025Updated last year
matchten / LoRA-Models-for-SAEs
View on GitHub
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Mar 31, 2025Updated last year
SukerZ / The-PyTorch-Self-Driving-Experiment-by-DDPG-on-TORCS
View on GitHub
用PyTorch重构流传最广的Keras、TensorFlow做的TORCS实验。训练DDPG模型。
☆12Dec 23, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Sammy20207109 / DyCo-RL
View on GitHub
DyCo-RL: Dynamic Cross-Modal Coordination for Visual Reasoning
☆18Jun 14, 2026Updated last month
BugMakerzzz / toxic_cot
View on GitHub
☆12Feb 28, 2025Updated last year
IgorWounds / Deribit-API-Algotrading101
View on GitHub
Deribit API article code
☆11Sep 4, 2022Updated 3 years ago
ustctf-zz / delibnet
View on GitHub
☆14Nov 16, 2022Updated 3 years ago
locuslab / acr-memorization
View on GitHub
☆41Dec 19, 2024Updated last year
kelechi-c / dit_flow
View on GitHub
DiT (training + flow matching) in Jax
☆12Jan 5, 2025Updated last year
Rh-Dang / DAT
View on GitHub
Dual Adaptive Thinking (DAT) for object navigation
☆14Sep 10, 2022Updated 3 years ago