Life

open

Introduction

This evaluates a model's ability to produce a coherent, step-by-step visual guide for daily life skills and household tasks.

Example

How to tie shoelaces with a bow knot? Show each step both visually and textually.

Ground truth image
reference (ground truth) image
Answer image
model generated image