Nkluge-correa / TeenyTinyLlamaLinks
A pair of tiny foundational models trained in Brazilian Portuguese.π¦π¦
β39Updated last month
Alternatives and similar repositories for TeenyTinyLlama
Users that are interested in TeenyTinyLlama are comparing it to the libraries listed below
Sorting:
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.β68Updated last week
- β48Updated last year
- Finetuning Stanford Alpaca (LLaMA) with Brazilian Portuguese dataβ39Updated 2 years ago
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.β47Updated 7 months ago
- The evalution suite for the π Open Portuguese LLM Leaderboardβ20Updated 3 months ago
- Natively pre-trained open-source Portuguese language models.β66Updated last month
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 8 months ago
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β75Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β168Updated last year
- Pre-train Static Word Embeddingsβ84Updated last month
- pre-trained Language Modelsβ305Updated 2 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β66Updated 2 weeks ago
- β17Updated last year
- Completion After Prompt Probability. Make your LLM make a choiceβ79Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ65Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated 8 months ago
- β87Updated last year
- Efficient few-shot learning with cross-encoders.β54Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ232Updated 8 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ108Updated 3 months ago
- nanogpt turned into a chat modelβ69Updated last year
- β48Updated 5 months ago
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Train your own small bitnet modelβ74Updated 8 months ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fastβ151Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hubβ162Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Updated last year
- Fine-tuning OpenLlama-Instruct with portuguese data, for commercial use.β20Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ184Updated last week