phunterlau / code-in-blogLinks
all code examples in the blog posts
☆21Updated 10 months ago
Alternatives and similar repositories for code-in-blog
Users that are interested in code-in-blog are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆136Updated this week
- ☆98Updated 8 months ago
- ☆146Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆48Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆122Updated 9 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆106Updated 7 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆113Updated 6 months ago
- ☆80Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 11 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆115Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- ☆79Updated last month
- Train LLM on Hugging Face infra☆67Updated 2 weeks ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.☆50Updated this week
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆123Updated 3 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆106Updated this week
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆38Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆117Updated 8 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆154Updated this week
- Collection of resources for RL and Reasoning☆26Updated 9 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆137Updated 2 years ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆56Updated 2 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 4 months ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆92Updated last week
- ☆120Updated last year
- ☆20Updated 10 months ago
- ☆80Updated 2 weeks ago