Mirrors: The Blind Spot of Image and Video Generation Models
a year ago
- #Reflections
- #Image Generation
- #AI
- Recent advances in image generation models struggle with accurately rendering reflections in mirrors.
- Five image generation models (Gemini, Adobe Firefly, Bing, Ideogram, Freepik) and four video generation models (veed.io, pollo.ai, ltx.studio, vidnoz.com) were evaluated.
- Common issues include distorted, inconsistent, or missing reflections, particularly in human and object scenarios.
- Gemini and Ideogram show recurring reflection errors, while Adobe Firefly and Bing exhibit severe misalignments.
- Video models also fail in motion reflections, degrading realism.
- Solutions proposed: improved architectures, enhanced training data, physics-based rendering, and explicit reflection modeling.
- Reflection challenges highlight gaps in 3D scene understanding, affecting applications like medical imaging and autonomous systems.