Professional Documents
Culture Documents
Commonsense Reasoning
Commonsense Reasoning
presumptions about the type and essence of ordinary situations humans encounter every day. These
assumptions include judgments about the nature of physical objects, taxonomic properties, and
peoples' intentions. A device that exhibits commonsense reasoning might be capable of drawing
conclusions that are similar to humans' folk psychology (humans' innate ability to reason about
people's behavior and intentions) and naive physics (humans' natural understanding of the physical
world).[1]
A self-driving car system may use a neural network to determine which parts of the picture seem to
match previous training images of pedestrians, and then model those areas as slow-moving but
somewhat unpredictable rectangular prisms that must be avoided.
Compared with humans, existing AI lacks several features of human commonsense reasoning; most
notably, humans have powerful mechanisms for reasoning about "naïve physics" such as space,
time, and physical interactions. This enables even young children to easily make inferences like "If
I roll this pen off a table, it will fall on the floor". Humans also have a powerful mechanism of "folk
psychology" that helps them to interpret natural-language sentences such as "The city councilmen
refused the demonstrators a permit because they advocated violence". (A generic AI has difficulty
discerning whether the ones alleged to be advocating violence are the councilmen or the
demonstrators.)[1][8][9] This lack of "common knowledge" means that AI often makes different
mistakes than humans make, in ways that can seem incomprehensible. For example, existing self-
driving cars cannot reason about the location nor the intentions of pedestrians in the exact way that
humans do, and instead must use non-human modes of reasoning to avoid accidents.[10][11][12]
Overlapping subtopics of commonsense reasoning include quantities and measurements, time and
space, physics, minds, society, plans and goals, and actions and change.[13]
Computer vision
Issues of this kind arise in computer vision.[1][17] For instance when looking at a photograph of a
bathroom some items that are small and only partly seen, such as facecloths and bottles, are
recognizable due to the surrounding objects (toilet, wash basin, bathtub), which suggest the purpose
of the room. In an isolated image they would be difficult to identify. Movies prove to be even more
difficult tasks. Some movies contain scenes and moments that cannot be understood by simply
matching memorized templates to images. For instance, to understand the context of the movie, the
viewer is required to make inferences about characters’ intentions and make presumptions
depending on their behavior. In the contemporary state of the art, it is impossible to build and
manage a program that will perform such tasks as reasoning, i.e. predicting characters’ actions. The
most that can be done is to identify basic actions and track characters.