taylorai / galacticLinks
data cleaning and curation for unstructured text
β328Updated last year
Alternatives and similar repositories for galactic
Users that are interested in galactic are comparing it to the libraries listed below
Sorting:
- Domain Adapted Language Modeling Toolkit - E2E RAGβ327Updated 10 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ103Updated last year
- π Datasets and models for instruction-tuningβ238Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β434Updated last year
- Synthetic Data for LLM Fine-Tuningβ120Updated last year
- Generate textbook-quality synthetic LLM pretraining dataβ505Updated last year
- β196Updated last year
- β210Updated 2 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β188Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ222Updated last year
- βοΈ build cognitive systems, pythonicβ339Updated 9 months ago
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.β100Updated last year
- Logging and caching superpowers for the openai sdkβ104Updated last year
- β199Updated last year
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Daβ¦β490Updated last year
- β461Updated last year
- β155Updated 9 months ago
- Neural Searchβ364Updated 6 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β83Updated 2 years ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)β399Updated last year
- Small finetuned LLMs for a diverse set of useful tasksβ128Updated 2 years ago
- Python tools for easily translating your blog content to podcasts & YouTubeβ206Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ102Updated last year
- Efficient vector database for hundred millions of embeddings.β207Updated last year
- A pythonic library providing light-weighted interface with LLMsβ129Updated 3 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ232Updated 10 months ago
- Notebooks for training universal 0-shot classifiers on many different tasksβ136Updated 8 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β177Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attentionβ118Updated last year
- A collection of LLM services you can self host via docker or modal labs to support your applications developmentβ194Updated last year