Our robot will learn "affordances", sort of.
It will infer an object from its appearance. Then it will learn how that appearance predicts how the object responds to actions directed towards that object.
What it will learn is a mapping from object and action to consequence. As a final demo, it will play "golf" with the object to get it to a target location - hopefully it will do this at above random capability after learning.