Article summary of A feedforward architecture accounts for rapid categorization by Serre, Oliva & Poggio - Chapter


Object recognition takes place in the ventral visual system in the cortex. This system runs from the visual V1 area to IT. From there, there are connections with the PFC that ensure that perception and memory are connected. The further down the path, the more specific the neurons are and the greater their receptive fields. Plasticity and learning are probably present in all phases of object recognition.

It is not known what feedback will be given from the phases. The hypothesis is that the basic processes of information go feedforward and are supported by short time limits that are required for specific responses. However, this hypothesis does not exclude feedback loops. The feedforward architecture is a reasonable starting point for the theory of the visual cortex aimed at explaining direct object recognition. The recognition phase can also play a role.

Model based on the feedforward theory

The model used here is a direct extension of the Wiesel and Hubel model. The model is a feedforward theory of visual processes. Every feedforward theory distinguishes between two types of cells; simple and complex cells. This creates a distinction between selectivity and invariance. This model uses an input of 7 × 7 boxes. The boxes are first analyzed by a multidimensional row of simple S1 units that respond best to lines and angles. The S1 units can be on and off if the figure they represent is present. The next level is C1. C1 looks like complex cells from the striatum. Every complex C1 unit receives information from a unit of S1 cells with the same orientation. However, there may be slight differences and C1 can receive multiple S1 units, making the C1 unit less sensitive to position and size.

At the C level, complex cells are thus converted into a two-dimensional image by combining the afferent S cells. The results are calculated with firing neurons. This creates a complex model with how a representation is created in the brain of a visual image. Reversing a C level is also called a MAX operation.

Not all features of a complete feedforward network can be found in this model. That would be too complicated. Every model tries to describe and explain a piece. Because we still have little knowledge of how everything works exactly, this remains very complicated. So you also have to put the different theories (and articles) next to each other and try to understand and apply them to each other. Just as in this model the MAX returns.

Results

The above data was presented and measured with a task where test subjects had to indicate whether or not an animal was presented. The showing of the pictures was very short. Yet people can then quickly and accurately decide whether or not there is an animal. This means that our object recognition works very quickly. If the image was displayed a little shorter this also had little effect. The model was used to view the similarities with people.

A comparison between human performance and the feedforward model in animal tasks was measured with (d'); a monotonic function of the performance of the observers. The results showed that people and the model responded about the same and that there were similar reactions. Recent studies have even shown that animal recognition also works if, for example, the image is shown upside down. People were a bit slower when using a concealment component.

Discussion

This model improves the old model on two important points.

  1. A learning phase has been added. The hierarchy of the visual system is viewed better. Learning takes place in two independent phases. First during development with unsupervised learning, then at task-specific circuits.

  2. The new model is closer to the anatomy of the visual cortex. This model also suggests (along with other theories) that an initial feedforward step is driven in bottom-up processes for a basic representation, built from a basic dictionary with general figures.

Access: 
Public
Work for WorldSupporter

Image

JoHo can really use your help!  Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world

Working for JoHo as a student in Leyden

Parttime werken voor JoHo

Image

Click & Go to more related summaries or chapters:

Summaries per article with Artificial intelligences and Neurocognition at Leiden University 20/21

Summaries per article with Artificial intelligences and Neurocognition at Leiden University 20/21

Supporting content: 
Access: 
Public

Study Guide with articlesummaries for Artificial Intelligence at Leiden University

Articlesummaries with Artificial Intelligence at Leiden University

Table of content

  • Modeling visual recognition from neurobiological constraints
  • Achieving machine realization of truly human-like intelligence
  • Untangling invariant object recognition
  • Computing machinery and intelligence
  • Rule-Based Expert Systems: The MYCIN Experiments of the Stanford Heuristic Programming Project
  • Robots with instincts
  • Male and female robots
  • Speed ​​of processing in the human visual system
  • Sparse but not ""Grandmother-cell"" coding in the medial temporal lobe
  • Perceptrons
  • Learning and neural plasticity in visual object recognition
  • Breaking position-invariant object recognition
  • A feedforward architecture accounts for rapid categorization
  • Hierarchical models of object recognition in cortex
  • Articlesummaries with prescribed articles for AI - 2020/2021
Access: 
Public
Comments, Compliments & Kudos:

Add new contribution

CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Image CAPTCHA
Enter the characters shown in the image.
Check how to use summaries on WorldSupporter.org


Online access to all summaries, study notes en practice exams

Using and finding summaries, study notes en practice exams on JoHo WorldSupporter

There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.

  1. Starting Pages: for some fields of study and some university curricula editors have created (start) magazines where customised selections of summaries are put together to smoothen navigation. When you have found a magazine of your likings, add that page to your favorites so you can easily go to that starting point directly from your profile during future visits. Below you will find some start magazines per field of study
  2. Use the menu above every page to go to one of the main starting pages
  3. Tags & Taxonomy: gives you insight in the amount of summaries that are tagged by authors on specific subjects. This type of navigation can help find summaries that you could have missed when just using the search tools. Tags are organised per field of study and per study institution. Note: not all content is tagged thoroughly, so when this approach doesn't give the results you were looking for, please check the search tool as back up
  4. Follow authors or (study) organizations: by following individual users, authors and your study organizations you are likely to discover more relevant study materials.
  5. Search tool : 'quick & dirty'- not very elegant but the fastest way to find a specific summary of a book or study assistance with a specific course or subject. The search tool is also available at the bottom of most pages

Do you want to share your summaries with JoHo WorldSupporter and its visitors?

Quicklinks to fields of study (main tags and taxonomy terms)

Field of study

Access level of this page
  • Public
  • WorldSupporters only
  • JoHo members
  • Private
Statistics
725