penfever/wildchat-50m

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/penfever/wildchat-50m)

penfever / wildchat-50m

Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.

☆39

Alternatives and similar repositories for wildchat-50m

Users that are interested in wildchat-50m are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HCY123902 / atg-w-fg-rw
View on GitHub
☆10May 27, 2024Updated 2 years ago
allenai / gpv2-web10k
View on GitHub
Download Web-10K data by querying Bing Image Search
☆10Feb 1, 2022Updated 4 years ago
Ego4DSounds / Ego4DSounds
View on GitHub
Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence
☆21Jun 14, 2024Updated 2 years ago
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
InternLM / StarBench
View on GitHub
[ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"
☆42Apr 19, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
FreedomIntelligence / MTalk-Bench
View on GitHub
MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols
☆20Nov 19, 2025Updated 8 months ago
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
elicit / fave-dataset
View on GitHub
Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"
☆14Oct 20, 2024Updated last year
OpenGVLab / TPO
View on GitHub
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
☆65Jul 22, 2025Updated last year
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
bothe / dialogue-act-recognition
View on GitHub
Context-based Dialogue Act Recognition using Recurrent Neural Networks
☆13Nov 13, 2021Updated 4 years ago
keeeeenw / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆14Mar 30, 2024Updated 2 years ago
Extrality / nvidia-dind
View on GitHub
docker:dind with NVIDIA GPU support via NVIDIA container toolkit
☆14Jul 1, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
g-luo / vlm_cross_modal_reps
View on GitHub
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆34May 1, 2025Updated last year
frank-hutter / fanova
View on GitHub
Functional ANOVA
☆29Nov 17, 2014Updated 11 years ago
wyu-du / Controlled-Dialogue-Generation
View on GitHub
This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…
☆12Dec 1, 2021Updated 4 years ago
bethelmelesse / UnifiedCrawl
View on GitHub
☆17Nov 26, 2024Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
SalesforceAIResearch / text2data
View on GitHub
☆22Jun 2, 2026Updated last month
AI-secure / PolyGuard
View on GitHub
☆23Jun 18, 2025Updated last year
mixedbread-ai / mxbai-rerank
View on GitHub
Crispy reranking models by Mixedbread
☆52Sep 17, 2025Updated 10 months ago
AbhilashaRavichander / HALoGEN
View on GitHub
Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"
☆25May 18, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SalesforceAIResearch / GemFilter
View on GitHub
☆84Jun 2, 2026Updated last month
InternLM / Spark
View on GitHub
An official implementation of "SPARK: Synergistic Policy And Reward Co-Evolving Framework"
☆25Oct 23, 2025Updated 9 months ago
apple / ml-mmtoolsandbox
View on GitHub
MM-ToolSandBox: A Unified Framework for Evaluating Visual Tool-Calling Agents
☆19Jul 14, 2026Updated last week
severian42 / Proteus-The-Genesis-LLM
View on GitHub
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆25Dec 20, 2024Updated last year
tarik / pi-snm-qde
View on GitHub
Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles. Published at Uncertainty in AI (UAI) 2020.
☆11Aug 31, 2020Updated 5 years ago
qagentur / texttunnel
View on GitHub
Python package for extractive NLP using the OpenAI API
☆17Aug 28, 2024Updated last year
jnwnlee / video-foley
View on GitHub
Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…
☆19Feb 27, 2026Updated 4 months ago
sanand0 / uv-mega
View on GitHub
uv - MEGA. Make Environments Great Again (talk)
☆11Feb 22, 2025Updated last year
utter-project / fairseq
View on GitHub
This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.
☆21Nov 19, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HowieHwong / Agentic-Guardian
View on GitHub
[ICLR'26] Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
☆48Oct 26, 2025Updated 9 months ago
replicate / cog-safe-push
View on GitHub
Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.
☆17Dec 4, 2025Updated 7 months ago
GeorgeVern / lmcor
View on GitHub
Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"
☆12Apr 20, 2024Updated 2 years ago
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
ytyz1307zzh / PLUG
View on GitHub
Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"
☆13Aug 13, 2025Updated 11 months ago
niveck / LLMafia
View on GitHub
Asynchronous LLM Agent playing games of Mafia against human players
☆23Nov 12, 2025Updated 8 months ago
GLJS / AudioToolAgent
View on GitHub
GitHub repository for AudioToolAgent
☆20Feb 13, 2026Updated 5 months ago