seongminp / hypersegLinks
Code for HyperSeg and HyperSum
☆15Updated 3 weeks ago
Alternatives and similar repositories for hyperseg
Users that are interested in hyperseg are comparing it to the libraries listed below
Sorting:
- Code for ExploreTom☆84Updated last month
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆107Updated 10 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆326Updated 3 months ago
- ☆118Updated 11 months ago
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation☆389Updated last week
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆160Updated 4 months ago
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆13Updated 4 months ago
- code for training & evaluating Contextual Document Embedding models☆196Updated 3 months ago
- ☆145Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆131Updated 4 months ago
- Implementations of attention with the softpick function, naive and FlashAttention-2☆81Updated 3 months ago
- This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report☆30Updated 5 months ago
- documentation for content creation☆211Updated this week
- Fine tune Gemma 3 on an object detection task☆74Updated 3 weeks ago
- Code repository for Black Mamba☆253Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆48Updated 11 months ago
- Research projects built on top of Transformers☆71Updated 5 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated 3 weeks ago
- Getting started with TensorRT-LLM using BLOOM as a case study☆20Updated last year
- ☆190Updated 8 months ago
- Building GPT ...☆18Updated 8 months ago
- ☆93Updated 4 months ago
- PyTorch implementation of models from the Zamba2 series.☆184Updated 6 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 9 months ago
- Google TPU optimizations for transformers models☆118Updated 6 months ago
- The first dense retrieval model that can be prompted like an LM☆82Updated 3 months ago
- Notes on quantization in neural networks☆96Updated last year