spcl / CheckEmbedLinks
Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"
☆22Updated 7 months ago
Alternatives and similar repositories for CheckEmbed
Users that are interested in CheckEmbed are comparing it to the libraries listed below
Sorting:
- ☆44Updated 8 months ago
- Utilities for constructing a large dataset of LLVM IR☆25Updated 7 months ago
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆19Updated last year
- ☆83Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆27Updated last year
- Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations☆20Updated 3 months ago
- ☆10Updated last year
- Parallel framework for training and fine-tuning deep neural networks☆70Updated 2 months ago
- ☆12Updated 9 months ago
- Sparsity support for PyTorch☆38Updated 10 months ago
- [NAACL 2025] Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"☆80Updated 3 weeks ago
- ☆69Updated last week
- some mixture of experts architecture implementations☆25Updated last year
- [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen☆18Updated last year
- ☆19Updated 9 months ago
- Source code for "BenchPress: A Deep Active Benchmark Generator", PACT 2022☆21Updated 2 years ago
- ☆38Updated last year
- ☆57Updated last year
- A library for code transformations with guaranteed legality☆20Updated last week
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆80Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Updated last year
- Source code for Activated LoRA☆23Updated 2 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated last year
- train with kittens!☆63Updated last year
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆14Updated 5 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆16Updated last year
- A Data-Centric Compiler for Machine Learning☆85Updated last month
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Updated last year