mkuchnik / relm
ReLM is a Regular Expression engine for Language Models
☆104Updated last year
Related projects ⓘ
Alternatives and complementary repositories for relm
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆93Updated 5 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆216Updated 7 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆148Updated last month
- experiments with inference on llama☆105Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆97Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆203Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆128Updated last month
- Evaluating LLMs with fewer examples☆135Updated 7 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- The official repo for "LLoCo: Learning Long Contexts Offline"☆113Updated 5 months ago
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- ☆94Updated 2 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated this week
- Manage scalable open LLM inference endpoints in Slurm clusters☆238Updated 4 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆183Updated 11 months ago
- ☆91Updated last year
- ☆200Updated 9 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆161Updated 10 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Updated last year
- ☆72Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆95Updated last month