lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
DBGC: Dimension-Based Generic Convolution Block for Object Recognition
Charotar Univ Sci & Technol CHARUSAT, India.ORCID iD: 0000-0001-8280-1140
Parul Univ, India.
Charotar Univ Sci & Technol CHARUSAT, India.
Charotar Univ Sci & Technol CHARUSAT, India.
Show others and affiliations
2022 (English)In: Sensors, E-ISSN 1424-8220, Vol. 22, no 5, article id 1780Article in journal (Refereed) Published
Abstract [en]

The object recognition concept is being widely used a result of increasing CCTV surveillance and the need for automatic object or activity detection from images or video. Increases in the use of various sensor networks have also raised the need of lightweight process frameworks. Much research has been carried out in this area, but the research scope is colossal as it deals with open-ended problems such as being able to achieve high accuracy in little time using lightweight process frameworks. Convolution Neural Networks and their variants are widely used in various computer vision activities, but most of the architectures of CNN are application-specific. There is always a need for generic architectures with better performance. This paper introduces the Dimension-Based Generic Convolution Block (DBGC), which can be used with any CNN to make the architecture generic and provide a dimension-wise selection of various height, width, and depth kernels. This single unit which uses the separable convolution concept provides multiple combinations using various dimension-based kernels. This single unit can be used for height-based, width-based, or depth-based dimensions; the same unit can even be used for height and width, width and depth, and depth and height dimensions. It can also be used for combinations involving all three dimensions of height, width, and depth. The main novelty of DBGC lies in the dimension selector block included in the proposed architecture. Proposed unoptimized kernel dimensions reduce FLOPs by around one third and also reduce the accuracy by around one half; semi-optimized kernel dimensions yield almost the same or higher accuracy with half the FLOPs of the original architecture, while optimized kernel dimensions provide 5 to 6% higher accuracy with around a 10 M reduction in FLOPs.

Place, publisher, year, edition, pages
MDPI, 2022. Vol. 22, no 5, article id 1780
Keywords [en]
CNN, separable convolution, DBGC, dimension-based kernels
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-111487DOI: 10.3390/s22051780ISI: 000775126200001PubMedID: 35270929Scopus ID: 2-s2.0-85125090746Local ID: 2022OAI: oai:DiVA.org:lnu-111487DiVA, id: diva2:1653056
Available from: 2022-04-20 Created: 2022-04-20 Last updated: 2023-08-14Bibliographically approved

Open Access in DiVA

fulltext(6132 kB)173 downloads
File information
File name FULLTEXT01.pdfFile size 6132 kBChecksum SHA-512
a773e9398633ad6ac5f1c9cf55b7565db929ca6e3c5cb4b3f3980888c389a3ca7ea1b5e4a699d43205440a988a22de987ae30f39ac8b4bcaea38018d48091689
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Ghayvat, Hemant

Search in DiVA

By author/editor
Patel, ChiragPandya, SharnilMajumdar, ShubhankarShah, Syed AzizGhayvat, Hemant
By organisation
Department of computer science and media technology (CM)
In the same journal
Sensors
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 173 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 153 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf