Hasty Briefsbeta

Bilingual

Mirrors: The Blind Spot of Image and Video Generation Models

a year ago
  • #Reflections
  • #Image Generation
  • #AI
  • Recent advances in image generation models struggle with accurately rendering reflections in mirrors.
  • Five image generation models (Gemini, Adobe Firefly, Bing, Ideogram, Freepik) and four video generation models (veed.io, pollo.ai, ltx.studio, vidnoz.com) were evaluated.
  • Common issues include distorted, inconsistent, or missing reflections, particularly in human and object scenarios.
  • Gemini and Ideogram show recurring reflection errors, while Adobe Firefly and Bing exhibit severe misalignments.
  • Video models also fail in motion reflections, degrading realism.
  • Solutions proposed: improved architectures, enhanced training data, physics-based rendering, and explicit reflection modeling.
  • Reflection challenges highlight gaps in 3D scene understanding, affecting applications like medical imaging and autonomous systems.