TY - GEN
T1 - FPGA based parallel architecture implementation of Stacked Error Diffusion algorithm
AU - Venugopal, Rishvanth Kora
AU - Heath, J. Robert
AU - Lau, Daniel L.
PY - 2011
Y1 - 2011
N2 - Digital halftoning is a crucial technique used in digital printers to convert a continuous-tone image into a pattern of black and white dots. Halftoning is used since printers have a limited availability of inks and cannot reproduce all the color intensities in a continuous image. Error Diffusion is an algorithm in halftoning that iteratively quantizes pixels in a neighborhood dependent fashion. This manuscript focuses on the development, design and Hardware Description Language (HDL) functional and performance simulation validation of a parallel scalable hardware architecture for high performance implementation of a high quality Stacked Error Diffusion algorithm. A CMYK printer, utilizing the high quality error diffusion algorithm, would be required to execute error diffusion 16 times per pixel, resulting in a potentially high computational cost. The algorithm, originally described in 'C', requires a significant processing time when implemented on a conventional single Central Processing Unit (CPU) based computer system. Thus, a new scalable high performance parallel hardware processor architecture is developed to implement the algorithm and is implemented to and tested on a single Programmable Logic Device (PLD) based Field Programmable Gate Array (FPGA) chip. There is a significant decrease in the run time of the algorithm when run on the newly proposed parallel architecture implemented to FPGA technology compared to execution on a single CPU based system.
AB - Digital halftoning is a crucial technique used in digital printers to convert a continuous-tone image into a pattern of black and white dots. Halftoning is used since printers have a limited availability of inks and cannot reproduce all the color intensities in a continuous image. Error Diffusion is an algorithm in halftoning that iteratively quantizes pixels in a neighborhood dependent fashion. This manuscript focuses on the development, design and Hardware Description Language (HDL) functional and performance simulation validation of a parallel scalable hardware architecture for high performance implementation of a high quality Stacked Error Diffusion algorithm. A CMYK printer, utilizing the high quality error diffusion algorithm, would be required to execute error diffusion 16 times per pixel, resulting in a potentially high computational cost. The algorithm, originally described in 'C', requires a significant processing time when implemented on a conventional single Central Processing Unit (CPU) based computer system. Thus, a new scalable high performance parallel hardware processor architecture is developed to implement the algorithm and is implemented to and tested on a single Programmable Logic Device (PLD) based Field Programmable Gate Array (FPGA) chip. There is a significant decrease in the run time of the algorithm when run on the newly proposed parallel architecture implemented to FPGA technology compared to execution on a single CPU based system.
KW - Application Specific Processor
KW - Digital Halftoning
KW - HDL Functional/Performance Simulation Validation
KW - Image Processor
KW - Multiprocessor System-on-Chip
KW - Reconfigurable Architecture
KW - Scalable Parallel Architecture
KW - Stacked Error Diffusion
UR - http://www.scopus.com/inward/record.url?scp=79961187842&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79961187842&partnerID=8YFLogxK
U2 - 10.1109/SASP.2011.5941080
DO - 10.1109/SASP.2011.5941080
M3 - Conference contribution
AN - SCOPUS:79961187842
SN - 9781457712111
T3 - Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011
SP - 66
EP - 69
BT - Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011
Y2 - 5 June 2011 through 6 June 2011
ER -