taylorai / galacticLinks
data cleaning and curation for unstructured text
β328Updated last year
Alternatives and similar repositories for galactic
Users that are interested in galactic are comparing it to the libraries listed below
Sorting:
- Domain Adapted Language Modeling Toolkit - E2E RAGβ327Updated 9 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ103Updated last year
- π Datasets and models for instruction-tuningβ238Updated last year
- Synthetic Data for LLM Fine-Tuningβ120Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ222Updated last year
- β210Updated last month
- Generate textbook-quality synthetic LLM pretraining dataβ503Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β188Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β434Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.β99Updated last year
- Efficient vector database for hundred millions of embeddings.β207Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)β398Updated last year
- β196Updated last year
- Small finetuned LLMs for a diverse set of useful tasksβ128Updated 2 years ago
- βοΈ build cognitive systems, pythonicβ339Updated 9 months ago
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Daβ¦β489Updated last year
- β199Updated last year
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- awesome synthetic (text) datasetsβ293Updated last month
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ102Updated last year
- Neural Searchβ363Updated 5 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busineβ¦β151Updated 10 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ114Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Updated last year
- β155Updated 8 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β243Updated 6 months ago
- β222Updated last year
- A comprehensive deep dive into the world of tokensβ226Updated last year
- A pythonic library providing light-weighted interface with LLMsβ128Updated 3 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications developmentβ193Updated last year