SpellcraftAI / oaib
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
☆90Updated 6 months ago
Related projects: ⓘ
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆50Updated 5 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆192Updated 4 months ago
- A simple Python sandbox for helpful LLM data agents☆143Updated 3 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- ☆172Updated 4 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆64Updated last week
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆45Updated last week
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Synthetic Data for LLM Fine-Tuning☆78Updated 9 months ago
- Sphynx Hallucination Induction☆44Updated last month
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆260Updated last month
- Logging and caching superpowers for the openai sdk☆98Updated 6 months ago
- ☆48Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- GPT-based Conversation Summarizer☆144Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆91Updated 2 months ago
- Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.☆143Updated 5 months ago
- ⛓️ build cognitive systems, pythonic☆321Updated 2 months ago
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆71Updated 3 weeks ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago
- ☆29Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆72Updated last week
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆152Updated last year
- Chat Markup Language conversation library☆53Updated 8 months ago