All Pages59 articles

Recently Updated Newest First Most Viewed A → Z

All Models Organizations Tools Concepts Peoples Protocols

GPT-4o Search Preview

GPT-4o Search Preview is an integrated real-time search functionality within OpenAI's multimodal GPT-4o model, designed to provide up-to-date information with direct source attribution.

model33/27/2026

Qwen 3 Next 80B Instruct

Qwen 3 Next 80B Instruct is a high-efficiency large language model featuring an ultra-sparse Mixture-of-Experts (MoE) architecture and hybrid attention. It activates only 3 billion of its 80 billion parameters per inference step, enabling significant throughput advantages for long-context tasks up to 256,000 tokens.

model53/27/2026

Mistral

Mistral AI is a French artificial intelligence company that develops high-performance generative AI and large language models, advocating for European technological sovereignty and decentralized AI development.

organization43/27/2026

Qwen 3 Next 80B Thinking

Qwen 3 Next 80B Thinking is a specialized reasoning large language model developed by Alibaba Cloud, featuring a high-sparsity Mixture of Experts architecture and an internal chain-of-thought mechanism. It is designed for complex tasks in mathematics, logic, and software engineering, operating with 80 billion total parameters but only 3 billion active per token.

model43/27/2026

Moonshot AI

Moonshot AI is a prominent Beijing-based artificial intelligence startup specializing in large language models (LLMs) and multimodal systems, known for its flagship Kimi chatbot and long-context window technology. Established in 2023, it is recognized as one of China's 'new four AI tigers' and has achieved significant market valuation through rapid technological scaling and strategic investment.

organization43/27/2026

QwQ 32B

QwQ 32B is a 32-billion-parameter reasoning large language model developed by Alibaba Cloud’s Qwen team, utilizing test-time compute and reinforcement learning to excel in complex mathematical and programming tasks.

model53/27/2026

Phi-4 Reasoning Plus

Phi-4-reasoning-vision-15B is a 15-billion parameter open-weight multimodal model developed by Microsoft that integrates visual perception with structured, multi-step logical reasoning. Part of the Phi family of small language models, it is optimized for high-efficiency tasks such as UI grounding, document processing, and mathematical reasoning on modest hardware.

model33/27/2026

R1-1776

R1-1776 is a reasoning-focused large language model released by Perplexity in 2025, designed as a modified, uncensored version of DeepSeek-R1. It employs targeted post-training to eliminate geopolitical restrictions while maintaining high performance in logical reasoning and mathematics.

model43/27/2026

R1 Distill Llama 70B

DeepSeek-R1-Distill-Llama-70B is a 70.6-billion parameter open-weights reasoning model developed by distilling the capabilities of the larger DeepSeek-R1 into a Llama-3.3-70B foundation. It is specifically optimized for high-level mathematical reasoning, coding, and logical tasks using an explicit chain-of-thought processing mechanism.

model33/27/2026

V3

DeepSeek-V3 is a 671-billion parameter Mixture-of-Experts (MoE) large language model developed by DeepSeek-AI and released in December 2024. It is designed for high-efficiency training and inference, achieving competitive performance with proprietary frontier models in technical domains like coding and mathematics.

model33/27/2026

Llama 4 Scout

Llama 4 Scout is a high-efficiency multimodal large language model released by Meta AI in April 2025, utilizing a mixture-of-experts (MoE) architecture with 109 billion total parameters. It is distinguished by its massive 10-million-token context window and its ability to natively process and integrate text and image inputs.

model33/27/2026

V3.1

V3.1 is an open-weights large language model developed by DeepSeek that integrates general-purpose conversational capabilities with advanced reasoning features. It utilizes a 671-billion parameter Mixture-of-Experts architecture and features a hybrid thinking mode allowing for chain-of-thought processing.

model43/27/2026

V3.2 Exp

V3.2 Exp is an experimental large language model developed by DeepSeek that introduced the DeepSeek Sparse Attention (DSA) mechanism to optimize long-context processing. Released in September 2025, it utilizes a Mixture-of-Experts architecture and serves as a hybrid model for both general-purpose instruction and complex reasoning.

model33/27/2026

Perplexity

Perplexity is an artificial intelligence organization founded in 2022 that develops AI-native search and research tools, primarily known for its conversational 'answer engine' that uses Retrieval-Augmented Generation to provide cited responses.

organization43/27/2026

Phi-4 Multimodal

Phi-4-reasoning-vision-15B is a 15 billion parameter multimodal model developed by Microsoft Research that integrates visual and audio perception with structured reasoning. Released in March 2026, it utilizes a mid-fusion architecture to perform complex tasks like UI grounding and scientific problem-solving on modest hardware.

model33/27/2026

Qwen 3 14B

Qwen 3 14B is a dense 14.8 billion parameter large language model developed by Alibaba Cloud, featuring a hybrid reasoning engine that allows toggling between thinking and non-thinking modes. Released in April 2025 under an Apache 2.0 license, it achieves performance parity with much larger previous-generation models in STEM, coding, and logical reasoning tasks.

model33/27/2026

Qwen 3 30B A3B

Qwen 3 30B A3B is a 30-billion parameter large language model developed by Alibaba Cloud, utilizing a sparse Mixture-of-Experts (MoE) architecture that activates 3 billion parameters per token for high efficiency. It is designed to provide mid-sized model reasoning capabilities with the speed and cost profile of a much smaller system, supporting a context window of up to 262,144 tokens.

model53/27/2026

R1-0528 Turbo

R1-0528 Turbo is a high-efficiency large language model developed by DeepSeek AI, optimized for throughput and complex reasoning using a Mixture-of-Experts (MoE) architecture. It is designed to provide advanced logic and coding capabilities with significantly reduced computational overhead and API costs.

model23/27/2026

← PreviousPage 3 of 4Next →