Paper

closed

Introduction

This task assesses whether a unified model can accurately explain complex concepts from cutting-edge computer science research in an accessible way. Given a user question about a particular method, the model must first understand the technical content and then provide a clear, coherent explanation.

Example

I'm studying Mixture of Experts (MoE). Can you explain the MoE layer architecture and how it works? Please answer with both visual and textual answers.

Ground truth image
reference (ground truth) image
Answer image
model generated image