Nkluge-correa / TeenyTinyLlama
A pair of tiny foundational models trained in Brazilian Portuguese.π¦π¦
β30Updated this week
Alternatives and similar repositories for TeenyTinyLlama:
Users that are interested in TeenyTinyLlama are comparing it to the libraries listed below
- β46Updated 11 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β76Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ37Updated 10 months ago
- Using short models to classify long textsβ21Updated last year
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β73Updated last year
- Library to facilitate pruning of LLMs based on contextβ31Updated 11 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β74Updated 3 months ago
- Repository for deepdoctection tutorial notebooksβ40Updated last month
- β34Updated last year
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.β65Updated 4 months ago
- Efficient few-shot learning with cross-encoders.β42Updated 11 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β63Updated last year
- Generate Structured JSON with probs from Language Modelsβ16Updated 7 months ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated last year
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sumβ¦β44Updated last year
- β24Updated last year
- Generalist and Lightweight Model for Text Classificationβ58Updated 2 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β63Updated 2 months ago
- β20Updated 11 months ago
- Fine-tune Mistral 7B to generate fashion style suggestionsβ33Updated last year
- A spaCy wrapper for GliNERβ101Updated 6 months ago
- Natively pre-trained open-source Portuguese language models.β48Updated this week
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hubβ156Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ49Updated 10 months ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogsβ114Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 6 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β26Updated 3 weeks ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 9 months ago