Strategies to optimize the LU factorization algorithm on multicore computers
![Thumbnail](/bitstream/handle/11086/28443/HPCLatAm2013-paper-17.pdf.jpg?sequence=4&isAllowed=y)
View/ Open
Date
2013Author
Soler, Janet
Ortiz, Javier
Wolfmann, Aaron Gustavo
Metadata
Show full item recordAbstract
The number of cores in multicore computers has an irreversible tendency to increase. Also, computers with multiple sockets to insert multicore chips are based on a complex hardware design and are becoming more common. To parallelize the algorithms that run on this type of computers in order to obtain a higher performance rate, is a goal that can only be achieved by taking into account hardware architecture. As hardware evolves, so must software. This leads to old parallelization strategies quickly become obsolete. This paper presents a series of alternatives for parallelization the LU factorization algorithm and its results intended to running on a multicore system. Simple strategies lead to poor results. This study presents complex strategies that merge double levels of parallelism with asynchronous scheduling whose results reach up to the State-of-the-art in the field and even go further.