seongminp / hypersegLinks
Code for HyperSeg and HyperSum
☆16Updated 4 months ago
Alternatives and similar repositories for hyperseg
Users that are interested in hyperseg are comparing it to the libraries listed below
Sorting:
- ☆120Updated last year
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- Code for ExploreTom☆87Updated 4 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆346Updated 6 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆162Updated 7 months ago
- ☆200Updated 11 months ago
- ☆146Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆98Updated 6 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆179Updated 4 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆60Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆202Updated last year
- Research projects built on top of Transformers☆100Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated last month
- LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.☆271Updated 3 weeks ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆119Updated 10 months ago
- ☆138Updated 3 months ago
- PyTorch implementation of models from the Zamba2 series.☆185Updated 10 months ago
- "Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"☆37Updated last year
- Prune transformer layers☆74Updated last year
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆14Updated 7 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆242Updated last year
- Set of scripts to finetune LLMs☆38Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆291Updated 8 months ago
- Train LLM on Hugging Face infra☆67Updated last week
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆69Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- MatFormer repo☆65Updated 11 months ago