spcl / CheckEmbed
Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"
☆14Updated last week
Related projects ⓘ
Alternatives and complementary repositories for CheckEmbed
- Utilities for constructing a large dataset of LLVM IR☆15Updated 3 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated 9 months ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Updated last year
- LLM-Inference-Bench☆11Updated 2 weeks ago
- ☆11Updated 3 years ago
- ☆20Updated 8 months ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆16Updated 4 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆13Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆14Updated 5 years ago
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆12Updated 6 months ago
- Training and Benchmarking LLMs for Code Preference.☆25Updated last week
- SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)☆42Updated 4 months ago
- TensorRT LLM Benchmark Configuration☆11Updated 4 months ago
- Compression for Foundation Models☆19Updated last month
- ☆19Updated last year
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆17Updated this week
- An Attention Superoptimizer☆20Updated 6 months ago
- Source code for "BenchPress: A Deep Active Benchmark Generator", PACT 2022☆21Updated last year
- Repo for the research paper "Aligning LLMs to Be Robust Against Prompt Injection"☆19Updated 3 weeks ago
- Hydragen: High-Throughput LLM Inference with Shared Prefixes☆25Updated 6 months ago
- ☆31Updated 2 months ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆34Updated 8 months ago
- Cascade Speculative Drafting☆26Updated 7 months ago
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆12Updated 8 months ago
- Code for Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture (accepted by PVLDB).The outdated wr…☆8Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Updated last month
- ☆35Updated 3 weeks ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated last year
- Beyond KV Caching: Shared Attention for Efficient LLMs☆13Updated 4 months ago