Nkluge-correa / TeenyTinyLlamaLinks
A pair of tiny foundational models trained in Brazilian Portuguese.π¦π¦
β43Updated 6 months ago
Alternatives and similar repositories for TeenyTinyLlama
Users that are interested in TeenyTinyLlama are comparing it to the libraries listed below
Sorting:
- β48Updated last year
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and theirβ¦β71Updated 5 months ago
- Pre-train Static Word Embeddingsβ94Updated 3 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β68Updated 3 weeks ago
- Natively pre-trained open-source Portuguese language models.β79Updated 4 months ago
- β17Updated last week
- Extract-0: A Specialized Language Model for Document Informationβ128Updated 3 months ago
- Code for training and evaluating T5 on Portuguese data.β91Updated 3 years ago
- The evalution suite for the π Open Portuguese LLM Leaderboardβ26Updated 4 months ago
- pre-trained Language Modelsβ309Updated 7 months ago
- Efficient few-shot learning with cross-encoders.β60Updated last year
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β76Updated 2 years ago
- Finetuning Stanford Alpaca (LLaMA) with Brazilian Portuguese dataβ39Updated 2 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ71Updated last year
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.β50Updated last year
- β28Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated last year
- Universal text classifier for generative modelsβ25Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Trainingβ74Updated 2 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β111Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languagesβ80Updated 3 years ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budgetβ163Updated 4 months ago
- Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and moreβ45Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ115Updated 8 months ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detectionβ¦β39Updated last month
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fastβ150Updated last year
- a unified framework for leveraging LLMsβ77Updated 2 weeks ago
- Train your own small bitnet modelβ76Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ212Updated 3 months ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β139Updated last year