Back to prompts
Benchmark comparison heatmap

Example images

Benchmark comparison heatmap 1
Charts & InfographicswuyoscarGPT-Image2-Skillcharts-infographics图表信息图

Benchmark comparison heatmap

Landscape 16:9 heatmap matrix of models × benchmarks. Columns (rotated 45°): "MMLU", "HumanEval", "GSM8K", "MATH", "BBH", "ARC-C", "HellaSwag", "TruthfulQA". Rows (right-aligned s

Category
Charts & Infographics
Model
GPT Image 2
Creator
wuyoscar
Source language
en
Views0
Source ID
084
Use in StudioOpen source

Full prompt

Landscape 16:9 heatmap matrix of models × benchmarks.

Columns (rotated 45°): "MMLU", "HumanEval", "GSM8K", "MATH", "BBH", "ARC-C", "HellaSwag", "TruthfulQA".
Rows (right-aligned sans-serif): "GPT-4o", "Claude 4.7 Opus", "Gemini 3 Pro", "Llama 4 405B", "Qwen3-Next", "DeepSeek-V3.1", "Mistral-3 Large", "Yi-3 34B", "Phi-4 14B", "OLMo-2 7B".

Each cell filled with dusty-teal gradient proportional to score; numeric value in each cell (e.g. "72.3", "88.1"). Best score per column outlined in 1.5px soft-terracotta.

Vertical color bar on the right with ticks "0", "25", "50", "75", "100" and label "accuracy (%)".

Title: "Benchmark comparison across 10 frontier LLMs". Subtitle: "zero-shot accuracy; best per benchmark outlined in bold. Evaluated March 2026."
Translations

Benchmark comparison heatmap

en

Landscape 16:9 heatmap matrix of models × benchmarks. Columns (rotated 45°): "MMLU", "HumanEval", "GSM8K", "MATH", "BBH", "ARC-C", "HellaSwag", "TruthfulQA". Rows (right-aligned sans-serif): "GPT-4o", "Claude 4.7 Opus", "Gemini 3 Pro", "Llama 4 405B", "Qwen3-Next", "DeepSeek-V3.1", "Mistral-3 Large", "Yi-3 34B", "Phi-4 14B", "OLMo-2 7B". Each cell filled with dusty-teal gradient proportional to score; numeric value in each cell (e.g. "72.3", "88.1"). Best score per column outlined in 1.5px soft-terracotta. Vertical color bar on the right with ticks "0", "25", "50", "75", "100" and label "accuracy (%)". Title: "Benchmark comparison across 10 frontier LLMs". Subtitle: "zero-shot accuracy; best per benchmark outlined in bold. Evaluated March 2026."

Prompt/Image Similar

12

Multi-head attention heatmaps

Multi-head attention heatmaps

Landscape 16:9 figure of 4 attention heatmaps (2×2 grid), shared 12-token input. Token labels across X and Y (rotated 45° on X): "The", "quick", "brown", "fox", "jumped", "over",

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Manga Art Style Comparison

Manga Art Style Comparison

Creates a side-by-side classroom manga comparison showing a generic anime comic transformed into a distinctive gag-manga style.

Charts & InfographicsYouMindcharts-infographics
GPT Image 20 Views
Denoising diffusion forward/reverse chain

Denoising diffusion forward/reverse chain

Landscape 16:9 academic figure of diffusion forward + reverse chains, two horizontal chains stacked vertically. TOP chain (left→right) labeled "Forward diffusion q(x_t | x_{t-1})"

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
ReAct reasoning trace

ReAct reasoning trace

Landscape 16:9 figure of a ReAct trace on a factual-QA task, vertical sequence of 7 alternating blocks. Top header: "Task — user asks: 'What year did the scientist who proved the

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Single-cell immune atlas reveals treatment-response states

Single-cell immune atlas reveals treatment-response states

Create a polished Nature / Cell style biomedical research figure, landscape 3:2 (1536×1024), soft minimal palette, publication-ready. Figure title: "Single-cell immune atlas revea

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Frontier Safety Eval Loop

Frontier Safety Eval Loop

Create a beautiful research flowchart for an AI safety benchmark pipeline called Frontier Safety Eval Loop. Landscape figure, white background, large typography, vector-like shapes

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Transformer encoder–decoder architecture

Transformer encoder–decoder architecture

Landscape 16:9 academic concept figure of the Transformer encoder-decoder architecture, NeurIPS camera-ready style. Two vertical column stacks side-by-side with a dashed divider.

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Small Multiples Climate Grid

Small Multiples Climate Grid

Produce a clean editorial data visualization poster showing a 4x3 small-multiples grid of monthly climate charts for 12 fictional cities. Use a white background, generous margins,

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Periodic Table Spectral Variant

Periodic Table Spectral Variant

Design a distinctive periodic table poster variant where each element tile is colored by fictional emission-spectrum families while preserving clean scientific layout. Use a dark n

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Patient cohort and multimodal biomarker workflow

Patient cohort and multimodal biomarker workflow

Create a Nature Medicine / Science Translational Medicine style research paper figure, landscape 3:2 (1536×1024), soft literature-science palette, minimal and elegant. Figure titl

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Minimalist bakery logo — Field & Flour

Minimalist bakery logo — Field & Flour

Create an original, non-infringing logo for a company called Field & Flour, a local bakery. The logo should feel warm, simple, and timeless. Use clean, vector-like shapes, a strong

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views
Rocket Cutaway Diagram

Rocket Cutaway Diagram

Generate a highly detailed vertical cutaway illustration of a fictional two-stage launch vehicle named Aster-9 on a clean white technical background. Show the full rocket from nose

Charts & InfographicswuyoscarGPT-Image2-Skill
GPT Image 20 Views