seongminp / hypersegLinks
Code for HyperSeg and HyperSum
☆16Updated 5 months ago
Alternatives and similar repositories for hyperseg
Users that are interested in hyperseg are comparing it to the libraries listed below
Sorting:
- ☆120Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆349Updated 7 months ago
- ☆148Updated last year
- ☆103Updated 8 months ago
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆118Updated 10 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆162Updated 8 months ago
- Train LLM on Hugging Face infra☆67Updated last month
- Library to facilitate pruning of LLMs based on context☆32Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Code for ExploreTom☆89Updated 5 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆69Updated last year
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Updated 10 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆103Updated 7 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆114Updated last year
- Set of scripts to finetune LLMs☆38Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆124Updated 10 months ago
- ☆113Updated 3 months ago
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆117Updated 5 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆202Updated last year
- ☆138Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆245Updated last year
- Reproduction of DeepSeek-R1☆242Updated 8 months ago
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 5 months ago
- PyTorch implementation of models from the Zamba2 series.☆186Updated 10 months ago