Neural Architecture Search by Estimation of Network Structure Distributions

Anton Muravev, Jenni Raitoharju, Moncef Gabbouj

Research output: Contribution to journalArticleScientificpeer-review

4 Citations (Scopus)
14 Downloads (Pure)

Abstract

The influence of deep learning is continuously expanding across different domains, and its new applications are ubiquitous. The question of neural network design thus increases in importance, as traditional empirical approaches are reaching their limits. Manual design of network architectures from scratch relies heavily on trial and error, while using existing pretrained models can introduce redundancies or vulnerabilities. Automated neural architecture design is able to overcome these problems, but the most successful algorithms operate on significantly constrained design spaces, assuming the target network to consist of identical repeating blocks. While such approach allows for faster search, it does so at the cost of expressivity. We instead propose an alternative probabilistic representation of a whole neural network structure under the assumption of independence between layer types. Our matrix of probabilities is equivalent to the population of models, but allows for discovery of structural irregularities, while being simple to interpret and analyze. We construct an architecture search algorithm, inspired by the estimation of distribution algorithms, to take advantage of this representation. The probability matrix is tuned towards generating high-performance models by repeatedly sampling the architectures and evaluating the corresponding networks, while gradually increasing the model depth. Our algorithm is shown to discover non-regular models which cannot be expressed via blocks, but are competitive both in accuracy and computational cost, while not utilizing complex dataflows or advanced training techniques, as well as remaining conceptually simple and highly extensible.

Original languageEnglish
Pages (from-to)15304-15319
Number of pages16
JournalIEEE Access
Volume9
DOIs
Publication statusPublished - 20 Jan 2021
Publication typeA1 Journal article-refereed

Keywords

  • Automatic architecture design
  • Computational modeling
  • Computer architecture
  • convolutional neural network
  • deep learning
  • Deep learning
  • Estimation
  • estimation of distribution algorithm
  • Search problems
  • Task analysis
  • Training

Publication forum classification

  • Publication forum level 2

ASJC Scopus subject areas

  • General Computer Science
  • General Materials Science
  • General Engineering

Fingerprint

Dive into the research topics of 'Neural Architecture Search by Estimation of Network Structure Distributions'. Together they form a unique fingerprint.

Cite this