Performance analysis of SSE and AVX instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems

Francés, Jorge; Bleda, Sergio; Márquez, Andrés; Neipp, Cristian; Gallego, Sergi; Otero Calviño, Beatriz; Beléndez, Augusto

Performance analysis of SSE and AVX instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10045/42001

Registro completo de metadatos

Registro completo de metadatos
Campo DC	Valor	Idioma
dc.contributor	Holografía y Procesado Óptico	es
dc.contributor.author	Francés, Jorge	-
dc.contributor.author	Bleda, Sergio	-
dc.contributor.author	Márquez, Andrés	-
dc.contributor.author	Neipp, Cristian	-
dc.contributor.author	Gallego, Sergi	-
dc.contributor.author	Otero Calviño, Beatriz	-
dc.contributor.author	Beléndez, Augusto	-
dc.contributor.other	Universidad de Alicante. Departamento de Física, Ingeniería de Sistemas y Teoría de la Señal	es
dc.contributor.other	Universidad de Alicante. Instituto Universitario de Física Aplicada a las Ciencias y las Tecnologías	es
dc.contributor.other	Universidad Politécnica de Cataluña. Departamento de Arquitectura de Computadores	es
dc.date.accessioned	2014-11-04T15:53:05Z	-
dc.date.available	2014-11-04T15:53:05Z	-
dc.date.created	2013-09	-
dc.date.issued	2014-11-01	-
dc.identifier.citation	The Journal of Supercomputing. 2014, 70(2): 514-526. doi:10.1007/s11227-013-1065-x	es
dc.identifier.issn	0920-8542 (Print)	-
dc.identifier.issn	1573-0484 (Online)	-
dc.identifier.uri	http://hdl.handle.net/10045/42001	-
dc.description.abstract	In this work a unified treatment of solid and fluid vibration problems is developed by means of the Finite-Difference Time-Domain (FDTD). The scheme here proposed takes advantage from a scaling factor in the velocity fields that improves the performance of the method and the vibration analysis in heterogenous media. Moreover, the scheme has been extended in order to simulate both the propagation in porous media and the lossy solid materials. In order to accurately reproduce the interaction of fluids and solids in FDTD both time and spatial resolutions must be reduced compared with the set up used in acoustic FDTD problems. This aspect implies the use of bigger grids and hence more time and memory resources. For reducing the time simulation costs, FDTD code has been adapted in order to exploit the resources available in modern parallel architectures. For CPUs the implicit usage of the advanced vectorial extensions (AVX) in multi-core CPUs has been considered. In addition, the computation has been distributed along the different cores available by means of OpenMP directives. Graphic Processing Units have been also considered and the degree of improvement achieved by means of this parallel architecture has been compared with the highly-tuned CPU scheme by means of the relative speed up. The speed up obtained by the parallel versions implemented were up to 3 (AVX and OpenMP) and 40 (CUDA) times faster than the best sequential version for CPU that also uses OpenMP with auto-vectorization techniques, but non includes implicitely vectorial instructions. Results obtained with both parallel approaches demonstrate that massive parallel programming techniques are mandatory in solid-vibration problems with FDTD.	es
dc.description.sponsorship	The work is partially supported by the “Ministerio de Economía y Competitividad” of Spain under project FIS2011-29803-C02-01, by the Spanish Ministry of Education (TIN2012-34557), by the “Generalitat Valenciana” of Spain under projects PROMETEO/2011/021 and ISIC/2012/013, and by the “Universidad de Alicante” of Spain under project GRE12-14.	es
dc.language	eng	es
dc.publisher	Springer Science+Business Media	es
dc.rights	The final publication is available at Springer via http://dx.doi.org/10.1007/s11227-013-1065-x	es
dc.subject	FDTD	es
dc.subject	GPU	es
dc.subject	CPU	es
dc.subject	OpenMP	es
dc.subject	AVX	es
dc.subject	Vibration	es
dc.subject.other	Física Aplicada	es
dc.title	Performance analysis of SSE and AVX instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems	es
dc.type	info:eu-repo/semantics/article	es
dc.peerreviewed	si	es
dc.identifier.doi	10.1007/s11227-013-1065-x	-
dc.relation.publisherversion	http://dx.doi.org/10.1007/s11227-013-1065-x	es
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es
Aparece en las colecciones:	INV - GHPO - Artículos de Revistas

Archivos en este ítem:

Archivos en este ítem:
Archivo	Descripción	Tamaño	Formato
J_of_Supercomputing_v70_p514_2014.pdf	Versión final (acceso restringido)	636,87 kB	Adobe PDF	Abrir Solicitar una copia
J_of_Supercomputing_v70_p514_2014_accepted.pdf	Versión revisada (acceso abierto)	1,02 MB	Adobe PDF	Abrir Vista previa Cerrar vista previa

Ver citas en Google Académico

Muestra el registro sencillo