High performance 2D convolution utilizing the AVX512on a multi-core architecture
รหัสดีโอไอ
Creator 1. Isamail Masamae
2. Panyayot Chaikan
Title High performance 2D convolution utilizing the AVX512on a multi-core architecture
Publisher Research and Development Office, Prince of Songkla University
Publication Year 2564
Journal Title Songklanakarin Journal of Science and Technology (SJST)
Journal Vol. 43
Journal No. 4
Page no. 1230-1236
Keyword AVX512, Advanced Vector Extension, 2D convolution, OpenMP, x64 assembly language
URL Website https://rdo.psu.ac.th/sjstweb/index.php
ISSN 0125-3395
Abstract Convolution is a time consuming operation, especially for signal and image processing, which led us to develop anefficient implementation of 2D convolution for a multi-core architecture utilizing AVX512 intrinsics and OpenMP. For singleprecision convolution, our algorithm is on average 2.30, 3.88, 5.75, and 19.95 times faster than the IPP, OpenCV, Baziotis'salgorithm, and MKL libraries. For double precision convolution, our algorithm is on average 3.12, 5.10, and 16.95 times fasterthan the OpenCV, Baziotis's algorithm, and MKL libraries. We have also developed a hybrid 2D convolution algorithm, writtenin C and assembly, to further augment the processing speeds for small kernel sizes.
Songklanakarin Journal of Science and Technology (SJST)

บรรณานุกรม

EndNote

APA

Chicago

MLA

ดิจิตอลไฟล์

Digital File
DOI Smart-Search
สวัสดีค่ะ ยินดีให้บริการสอบถาม และสืบค้นข้อมูลตัวระบุวัตถุดิจิทัล (ดีโอไอ) สำนักการวิจัยแห่งชาติ (วช.) ค่ะ