My research builds interactive AI by integrating high-level common sense (functionality, affordance, physics, causality) with raw sensory inputs (pixels and haptic signals) to enable richer representation and abstract reasoning on objects, scenes, shapes, numbers, and agents.