Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA
 

Wavelet transform for large scale image processing on modern microprocessors

dc.book.titleHigh performance computing for computational science - VECPAR 2002
dc.contributor.authorChaver Martínez, Daniel Ángel
dc.contributor.authorTenllado Van Der Reijden, Christian Tomás
dc.contributor.authorPiñuel Moreno, Luis
dc.contributor.authorTirado Fernández, José Francisco
dc.date.accessioned2023-06-20T21:10:49Z
dc.date.available2023-06-20T21:10:49Z
dc.date.issued2003
dc.description© 2003 Springer Verlag. International Conference on High Performance Computing for Computational Science (VECPAR 2002) (5th. jun 26-28, 2002. Oporto, Portugal).
dc.description.abstractIn this paper we discuss several issues relevant to the vectorization of a 2-D Discrete Wavelet Transform on current microprocessors. Our research is based on previous studies about the efficient exploitation of the memory hierarchy, due to its tremendous impact on performance. We have extended this work with a more detailed analysis based on hardware performance counters and a study of vectorization, in particular, we have used the Intel Pentium SSE instruction set. Most of our optimizations are performed at source code level to allow automatic vectorization, though some compiler intrinsic functions have been introduced to enhance performance. Taking into account the abstraction at which the optimizations are performed, the results obtained on an Intel Pentium III microprocessor are quite satisfactory, even though further improvement can be obtained by a more extensive use of compiler intrinsics.
dc.description.departmentSección Deptal. de Arquitectura de Computadores y Automática (Físicas)
dc.description.facultyFac. de Ciencias Físicas
dc.description.refereedTRUE
dc.description.statuspub
dc.eprint.idhttps://eprints.ucm.es/id/eprint/29933
dc.identifier.isbn3-540-00852-7
dc.identifier.officialurlhttp://dx.doi.org/10.1007/3-540-36569-9_37
dc.identifier.relatedurlhttp://link.springer.com/
dc.identifier.relatedurlhttp://www.researchgate.net/publication/2544724_Wavelet_Transform_for_Large_Scale_Image_Processing_on_Modern_Microprocessors/file/50463525da13f6212a.pdf
dc.identifier.urihttps://hdl.handle.net/20.500.14352/60900
dc.language.isoeng
dc.page.final562
dc.page.initial549
dc.publisherSpringer-Verlag Berlin
dc.rights.accessRightsopen access
dc.subject.cdu004
dc.subject.keywordPerformance
dc.subject.ucmInformática (Informática)
dc.subject.ucmProgramación de ordenadores (Informática)
dc.subject.unesco1203.17 Informática
dc.subject.unesco1203.23 Lenguajes de Programación
dc.titleWavelet transform for large scale image processing on modern microprocessors
dc.typebook part
dc.volume.number2565
dcterms.references[1] Z. Zhang and R. S. Blum. A Categorization of Multiscale-Decomposition-Based ImageFusion Schemes with a Performance Study for a Digital Camera Application. Proceeding of the IEEE, Vol. 87(8):1315-1325, August 1999 [2] E. J. Stollnitz, T. D. DeRose and D. H. Salesin. Wavelets for Computer Graphics: Theory and Applications. Computer Graphics and Geometric Modeling, Morgan Kaufmann Publishers, Inc. San Francisco, 1996 [3] Intel Corp. Pentium-III processor. http://developer.intel.com/design/PentiumIII [4] C. Chakrabarti and C. Mumford. Efficient realizations of encoders and decoders based on the 2-D discrete wavelet transforms. IEEE Trans. VLSI Syst., pp. 289-298, September 1999 [5] T. Denk and K. Parhi. LSI Architectures for Lattice Structure Based Orthonormal Discrete Wavelet Transforms. IEEE Trans. Circuits and Systems, vol. 44, pp. 129-132, February 1997 [6] C. Chrysafis and A. Ortega. Line Based Reduced Memory Wavelet Image Compression. IEEE Trans. on Image Processing, Vol 9, No 3, pp. 378-389, March 2000 [7] S. Chatterjee, V. V. Jain, et al. Nonlinear Array Layouts for Hierarchical Memory Systems. Proceedings of 1999 ACM International Conference on Supercomputing, pp. 444-453, Rhodes, Greece, June 1999 [8] P. Meerwald, R. Norcen, et al. Cache issues with JPEG2000 wavelet lifting. In C.-C. Jay Kuo, editor, Visual Communications and Image Processing 2002 (VCIP'02), volume 4671 of SPIE Proceedings, San Jose, CA, USA, January 2002 [9] Intel Corp. C/C++ Compiler. http://www.intel.com/software/products/compilers [10] K. London, J. Dongarra, et al. End-user Tools for Application Performance Analysis, Using Hardware Counters. Presented at International Conference on Parallel and Distributed Computing Systems. August 2001 [11] Perfctr Linux driver. Info. available at http://www.csd.uu.se/~mikpe/linux/perfctr [12] Intel Corp. Data Alignment and Programming Issues for the Streaming SIMD Extensions with the Intel C/C++ Compiler. Intel Application Note AP-833. Available at http://developer.intel.com [13] Intel Corp. Intel Architecture Optimization. Reference Manual. Available at http://developer.intel.com [14] M. Holmström. Parallelizing the fast wavelet transform. Parallel Computing, 11(21):1837- 1848, April 1995 [15] D. Chaver, M. Prieto, L. Piñuel, F. Tirado. Parallel Wavelet Transform for Large Scale Image Processing. Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS'2002). Florida, USA, April 2002 [16] O.M. Nielsen and M. Hegland. Parallel Performance of Fast Wavelet Transform. International Journal of High Speed Computing, 11 (1): 55-73, June 2000 [17] L. Yang and M. Misra. Coarse-Grained Parallel Algorithms for Multi-Dimensional Wavelet Transforms. The journal of Supercomputing 11:1-22 , 1997 [18] M. Feil and A. Uhl. Multicomputer algorithms for wavelet packet image decomposition. Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS'2000), pages 793-798, Cancun, Mexico, 2000 [19] Intel Corp. Real and Complex FIR Filter Using Streaming SIMD Extensions. Intel Application Note AP-809. Available at http://developer.intel.com [20] D. Chaver, C. Tenllado, L. Piñuel, M. Prieto and F. Tirado. Vectorizing the Wavelet Transform on the Intel Pentium III Microprocessor. Technical Report 02-001. Dept. of Computer Architecture. Complutense University, 2002
dspace.entity.typePublication
relation.isAuthorOfPublication6b8b1488-47cc-441e-921b-c1e8042d627c
relation.isAuthorOfPublicationd47f11bf-2134-459b-bcf7-6e1efa4aa8b6
relation.isAuthorOfPublication2ce782af-0e05-45eb-b58a-d2efffec6785
relation.isAuthorOfPublication1356616c-9e69-4852-8415-62fd0b8e7cfc
relation.isAuthorOfPublication.latestForDiscovery6b8b1488-47cc-441e-921b-c1e8042d627c

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
piñuel24preprint.pdf
Size:
585.17 KB
Format:
Adobe Portable Document Format