seongminp / hypersegLinks

Code for HyperSeg and HyperSum

☆16

Alternatives and similar repositories for hyperseg

Users that are interested in hyperseg are comparing it to the libraries listed below

Sorting:

writer / writing-in-the-margins
☆120Updated last year
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆200Updated 6 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆87Updated 4 months ago
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆346Updated 6 months ago
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆275Updated last year
itsnamgyu / block-transformer
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
☆162Updated 7 months ago
NVlabs / hymba
☆200Updated 11 months ago
apple / ml-superposition-prompting
☆146Updated last year
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆98Updated 6 months ago
Jaykef / ai-algorithms
First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…
☆179Updated 4 months ago
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆60Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
huggingface / transformers-research-projects
Research projects built on top of Transformers
☆100Updated 8 months ago
samchaineau / llm_slerp_generation
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆37Updated last month
microsoft / LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
☆271Updated 3 weeks ago
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆48Updated last year
sunnynexus / RetroLLM
[ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation
☆119Updated 10 months ago
QuixiAI / spectrum
☆138Updated 3 months ago
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆185Updated 10 months ago
yixuantt / PoolingAndAttn
"Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"
☆37Updated last year
melisa-writer / short-transformers
Prune transformer layers
☆74Updated last year
RamonKaspar / MathPrompter
MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…
☆14Updated 7 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆242Updated last year
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆38Updated last year
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆291Updated 8 months ago
huggingface / trl-jobs
Train LLM on Hugging Face infra
☆67Updated last week
hyintell / RetrievalQA
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆69Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆51Updated last year
devvrit / matformer
MatFormer repo
☆65Updated 11 months ago