taylorai / galactic
data cleaning and curation for unstructured text
β329Updated 9 months ago
Alternatives and similar repositories for galactic:
Users that are interested in galactic are comparing it to the libraries listed below
- Synthetic Data for LLM Fine-Tuningβ115Updated last year
- π Datasets and models for instruction-tuningβ238Updated last year
- Generate textbook-quality synthetic LLM pretraining dataβ498Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β186Updated last year
- βοΈ build cognitive systems, pythonicβ336Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β421Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.β98Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ223Updated last year
- Neural Searchβ355Updated last month
- β195Updated last year
- β199Updated last year
- A comprehensive deep dive into the world of tokensβ222Updated 10 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ320Updated 5 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β235Updated 2 months ago
- Python tools for easily translating your blog content to podcasts & YouTubeβ206Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Updated last year
- AgentSearch is a framework for powering search agents and enabling customizable local search.β483Updated last year
- Automatically evaluate your LLMs in Google Colabβ620Updated 11 months ago
- β153Updated 9 months ago
- Notebooks for training universal 0-shot classifiers on many different tasksβ124Updated 4 months ago
- awesome synthetic (text) datasetsβ278Updated 6 months ago
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ199Updated last week
- β219Updated last year
- Create repos and commits with AI.β293Updated last year
- Small finetuned LLMs for a diverse set of useful tasksβ126Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Dataβ494Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)β263Updated last year
- Solving data for LLMs - Create quality synthetic datasets!β146Updated 3 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ254Updated 9 months ago