taylorai / galactic
data cleaning and curation for unstructured text
β329Updated 7 months ago
Alternatives and similar repositories for galactic:
Users that are interested in galactic are comparing it to the libraries listed below
- π Datasets and models for instruction-tuningβ234Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β410Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGβ316Updated 4 months ago
- β195Updated 10 months ago
- Neural Searchβ351Updated this week
- Generate textbook-quality synthetic LLM pretraining dataβ498Updated last year
- βοΈ build cognitive systems, pythonicβ331Updated 3 months ago
- Synthetic Data for LLM Fine-Tuningβ111Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β227Updated 3 weeks ago
- A comprehensive deep dive into the world of tokensβ222Updated 8 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β184Updated last year
- β199Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ224Updated 10 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busineβ¦β148Updated 5 months ago
- Logging and caching superpowers for the openai sdkβ102Updated 11 months ago
- Prompt programming with FMs.β440Updated 7 months ago
- Convert all of libgen to high quality markdownβ248Updated last year
- batched lorasβ338Updated last year
- β447Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.β95Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ189Updated 5 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications developmentβ186Updated 10 months ago
- A bagel, with everything.β317Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated 11 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ83Updated last month
- AgentSearch is a framework for powering search agents and enabling customizable local search.β475Updated 10 months ago
- β149Updated 3 months ago
- Late Interaction Models Training & Retrievalβ254Updated this week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β171Updated 6 months ago