Conventional VLMs excel at recognizing objects and detecting anomalies, but cities need more, they need to understand why an event is happening and what might happen next. This is the gap that a ...