Abstract

Vision applications are becoming increasingly important for product quality surveillance in manufacturing. Training consistently well-performing visual detection algorithms based on convolutional neural networks is very challenging. Typically, there is too much training data for engineers to keep track of possible gaps in it. But even small cases of missing training data e.g. certain viewing angles can lead to trained CNNs that are unable to detect objects, that seem obvious to engineers i.e. cognition gaps. This paper presents how synthetic training data can be created in a targeted manner to close cognitive gaps of a CNN for specific use-cases. The proposed methodology uses 3D rendering to create new image data by variating scene parameters. The created data is used to reveal a cognition gap of a CNN. We show that by using this created synthetic data to train the CNN the cognition gap can be successfully closed. This is evaluated with the well-known AlexNet CNN used as a visual bicycle detector. The bicycle example is used as a stand-in for a geometrically interesting, but simple product, that is manufactured in large and growing amounts.

Links and resources

Tags

community

  • @unibiblio
  • @taylansngerli
@taylansngerli's tags highlighted