spcl / CheckEmbed
Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"
☆17Updated last month
Alternatives and similar repositories for CheckEmbed:
Users that are interested in CheckEmbed are comparing it to the libraries listed below
- Utilities for constructing a large dataset of LLVM IR☆16Updated 5 months ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆14Updated 3 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆14Updated 5 years ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆14Updated 2 years ago
- ☆43Updated 2 months ago
- Explore training for quantized models☆12Updated last week
- ☆20Updated last year
- Fast and memory-efficient exact attention☆52Updated last month
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆14Updated 8 months ago
- Personal solutions to the Triton Puzzles☆18Updated 6 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated 3 months ago
- A minimal implementation of vllm.☆32Updated 5 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆22Updated 2 months ago
- TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆26Updated last month
- Minimum Description Length probing for neural network representations☆18Updated last week
- ☆16Updated 2 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆11Updated 4 months ago
- Benchmark tests supporting the TiledCUDA library.☆12Updated last month
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- Training and Benchmarking LLMs for Code Preference.☆28Updated 2 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆16Updated 7 months ago
- TensorRT LLM Benchmark Configuration☆12Updated 5 months ago
- ☆21Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆10Updated 3 months ago
- The Efficiency Spectrum of LLM☆52Updated last year
- Example ML projects that use the Determined library.☆25Updated 4 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆15Updated last month
- Awesome Triton Resources☆19Updated last month
- ☆12Updated 3 years ago