logikon-ai / logikon
Analyzing and scoring reasoning traces of LLMs
☆37Updated 2 weeks ago
Related projects: ⓘ
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆44Updated 3 months ago
- Sphynx Hallucination Induction☆44Updated last month
- A re-implementation of Meta-Prompt in LangChain for building self-improving agents.☆57Updated last year
- ☆38Updated this week
- End-to-end zero-shot entity and relation extraction☆50Updated last month
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆26Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆58Updated 2 weeks ago
- A set of utilities for running few-shot prompting experiments on large-language models☆106Updated 10 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆68Updated last week
- Code interpreter support for o1☆24Updated last week
- An attribution library for LLMs☆31Updated this week
- AI Evaluation Platform☆39Updated 3 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆101Updated this week
- A strongly typed Python DSL for developing message passing multi agent systems☆50Updated 5 months ago
- Hosted embedding platform to discover, evaluate, and retrieve embeddings☆72Updated 11 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆60Updated 3 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆99Updated 4 months ago
- AI search: your data + 10 lines of code.☆73Updated last month
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆59Updated 9 months ago
- LLM finetuning☆41Updated last year
- LLM prompt language based on Jinja☆52Updated 2 weeks ago
- 🐤 Canary provides UI primitives for building modern search-bar for docs with self-hostable infrastructure.☆41Updated this week
- ☆34Updated 2 months ago
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- Simple Graph Memory for AI applications☆76Updated last month
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- Harness used to benchmark aider against SWE Bench benchmarks☆44Updated 2 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆77Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- Convert a web page to markdown☆50Updated 3 weeks ago