phunterlau / code-in-blogLinks

all code examples in the blog posts

☆22

Alternatives and similar repositories for code-in-blog

Users that are interested in code-in-blog are comparing it to the libraries listed below

Sorting:

VectorInstitute / fed-rag
A framework for fine-tuning retrieval-augmented generation (RAG) systems.
☆122Updated this week
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
apple / ml-superposition-prompting
☆145Updated 11 months ago
ali-bahrainian / RAG_best_practices
☆94Updated 3 months ago
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
PrithivirajDamodaran / Route0x
Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da
☆105Updated 3 months ago
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆108Updated 3 months ago
AgnostiqHQ / multi-agent-llm
Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)
☆115Updated 5 months ago
Pleias / RL-Reasoning
Collection of resources for RL and Reasoning
☆25Updated 5 months ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated last month
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated 9 months ago
HishamAlyahya / semantic_backprop
Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖
☆71Updated 7 months ago
LLMSELECTOR / LLMSELECTOR
☆71Updated 4 months ago
marib00 / llamaindex-embedding-lora
☆29Updated last year
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆276Updated last year
cvs-health / langfair
LangFair is a Python library for conducting use-case level LLM bias and fairness assessments
☆219Updated this week
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
Shekswess / synthgenai
SynthGenAI - Package for Generating Synthetic Datasets using LLMs.
☆37Updated 5 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆73Updated 8 months ago
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆123Updated last week
microsoft / eureka-ml-insights
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
☆163Updated this week
sujitpal / llm-rag-eval
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆29Updated last year
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆34Updated last month
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
rungalileo / hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.
☆111Updated 10 months ago
spcl / knowledge-graph-of-thoughts
Official Implementation of "Affordable AI Assistants with Knowledge Graph of Thoughts"
☆131Updated 3 weeks ago
LitLLM / litllms-for-literature-review-tmlr
Code for LitLLMs, LLMs for Literature Review: Are we there yet? (TMLR 2025)
☆33Updated 2 months ago
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 9 months ago
writer / writing-in-the-margins
☆118Updated 10 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 7 months ago