Gaia | Living Knowledge Network, Decentralized AI

All LLMs

Codestral is Mistral AI's 22B code model, proficient in over 80 programming languages. It can generate code, write tests and complete partial code for tasks such as Java, Python or C++.

DeepSeekR1DistillLlama

DeepSeek-R1-Zero, a reinforcement learning model, shows remarkable reasoning performance but faces challenges. DeepSeek-R1 addresses these issues and matches OpenAI-o1's performance.

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B, build and developed by LG AI Research.

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding.

A family of lightweight open language models from Google, based on Gemini technology. These efficient models excel at text generation tasks while being small enough to run on personal computers.

Gemma 3 is a collection of lightweight, state-of-the-art open models built from the same research and technology that powers our Gemini 2.0 models. These are our most advanced, portable and responsibly developed open models yet. They are designed to run fast, directly on devices — from phones and laptops to workstations — helping developers create AI applications, wherever people need them. Gemma 3 comes in a range of sizes (1B, 4B, 12B and 27B), allowing you to choose the best model for your specific hardware and performance needs.

The gpt-oss 20b model isOpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

The Meta Llama 3.1 collection includes pre-trained and instruction-tuned generative models in 8B, 70B, and 405B sizes. These text-only models excel in multilingual dialogue applications, outperforming many open-source and closed-source chat models on industry benchmarks.

The Meta Llama 3.2 collection features pre-trained and instruction-tuned generative models in 1B and 3B sizes. These text-only models excel in multilingual dialogue applications, including agentic retrieval and summarization tasks, and outperform many open-source and closed-source chat models on industry benchmarks.

The Meta Llama 3.3 model is a 70B-sized, instruction-tuned LLM optimized for multilingual dialogue applications. It outperforms many open-source and closed-source chat models on industry benchmarks, making it a top choice for dialogue use cases.

Llama 4 Scout is auto-regressive language models that use a mixture-of-experts (MoE) architecture and incorporate early fusion for native multimodality.

MiniCPM-V 2.6 is the latest and most capable model in the MiniCPM-V series. The model is built on SigLip-400M and Qwen2-7B with a total of 8B parameters. It exhibits a significant performance improvement over MiniCPM-Llama3-V 2.5, and introduces new features for multi-image and video understanding.

The gpt-oss 20b model isOpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

This model, Phi-3.5-mini, is a compact version of the advanced Phi-3 family, trained on high-quality data and focusing on complex reasoning.

Phi-4 is a cutting-edge model combining high-quality data from books, websites and Q&A datasets for advanced reasoning. It was refined through supervised fine-tuning and safety optimization.

The latest series of Code-Specific Qwen models, delivering significant improvements in code generation, reasoning, and fixing capabilities.

The latest series of Code-Specific Qwen models, delivering significant improvements in code generation, reasoning, and fixing capabilities.

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

Yi-1.5 is an improved version of Yi, trained on 500B tokens and fine-tuned on 3M examples. It exceeds Yi in coding, math, reasoning, and following instructions, while maintaining strong language understanding and reading skills.

An efficient open-source code language model delivering state-of-the-art performance under 10B parameters, with 128K token context length and support for 52 programming languages.

Sign up for updates from the Gaia team

Products

Gaia DomainAgentsChatNetwork MapSamsung Galaxy

Developers

LLM LibraryDev DocsGaia CookbookUse CaseWhitepaper

Community

AcademyBlogEcosystemGuardian ProgramAI Sovereignty Alliance

Contribution

XP Program

Legal and Privacy

Terms of ServiceMiCA White Paper

TOKEN

Gaia token icon

GAIAGaia Token

Ethereum

0x2EE7...aaEd81

Base

0x6FbF...90B683

BNB Chain

0xd715...1E4837

©2025 Gaia

Terms