Georgios I. Goumas



copyright notice

The material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's or organization's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

[The following is a highly outdated list. You may find updated versions in DBLP and Google scholar.]

Journal papers

V. Karakasis, T. Gkountouvas, K. Kourtis, G. Goumas, and N. Koziris, "An Extended Compression Format for the Optimization of Sparse Matrix-Vector Multiplication," IEEE Transactions on Parallel and Distributed Systems (TPDS), 24(10):1930–1940, 2013. (pdf).

K. Kourtis, G. Goumas and N. Koziris, "Exploiting Compression Opportunities to Improve SpMxV Performance on Shared Memory Systems," ACM Transactions on Architecture and Code Optimization (TACO), Vol 7, No 3, December 2011 (pdf).

G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis and N. Koziris, "Performance Evaluation of the Sparse Matrix-vector Multiplication on Modern Architectures," The Journal of Supercomputing, Vol 50, No 1, 2009 (pdf).

G. Goumas, N. Drosinos and N. Koziris, "Communication-aware Supernode Shape," IEEE Transactions on Parallel and Distributed Systems, Vol. 20, No 4, April 2009 (pdf).

G. Goumas, N. Drosinos, M. Athanasaki and N. Koziris, "Message-Passing Code Generation for Non-rectangular Tiling Transformations," Parallel Computing, Vol. 32, No. 11, November 2006 (pdf).

N. Koziris, A. Sotiropoulos and G. Goumas, "A Pipelined Schedule to Minimize Completion Time for Loop Tiling with Computation and Communication Overlapping," Journal of Parallel and Distributed Computing, Vol. 63, No. 11, November 2003 (pdf).

G. Goumas, M. Athanasaki and N. Koziris, "An Efficient Code Generation Technique for Tiled Iteration Spaces," IEEE Transactions on Parallel and Distributed Systems, Vol. 14, No 10, October 2003 (pdf).

G. Goumas, M. Athanasaki and N. Koziris, "Code Generation Methods for Tiling Transformations," Journal of Information Science and Engineering, Vol. 18, No 5, September 2002 (pdf).

Conference papers

T. Gkountouvas, V. Karakasis, K. Kourtis, G. Goumas, and N. Koziris., "Improving the performance of the symmetric sparse matrix-vector multiplication in multicore," Proceedings of the 27th IEEE International Parallel & Distributed Processing Symposium (IPDPS'13), Boston, MA, USA, 2013, (pdf).

K. Kourtis, V. Karakasis, G. Goumas and N. Koziris, "CSX: An Extended Compression Format for SpMV on Shared Memory Systems," Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming (PPoPP), San Antonio, Texas, USA, 12-16 February, 2011 (pdf).

V. Karakasis, G. Goumas and N. Koziris, "Perfomance Models for Blocked Sparse Matrix-Vector Multiplication kernels," Proceedings of the International Conference on Parallel Processing (ICPP), Vienna, Austria, 22-25 September, 2009 (pdf).

K. Nikas, N. Anastopoulos, G. Goumas and N. Koziris, "Employing Transactional Memory and Helper Threads to Speedup Dijkstra’s Algorithm," Proceedings of the International Conference on Parallel Processing (ICPP), Vienna, Austria, 22-25 September, 2009 (pdf).

G. Goumas, N.Anastopoulos, N. Ioannou and N. Koziris, "Overlapping Computation and Communication in SMT Clusters with Commodity Interconnects," Proceedings of the IEEE Cluster 2009, New Orleans, Louisiana, USA, 31 August.- 4 September, 2009 (pdf).

V. Karakasis, G. Goumas and N. Koziris, "A Comparative Study of Blocking Storage Methods for Sparse Matrices on Multicore Architectures," Proceedings of the IEEE International Conference on Computational Science and Engineering (CSE), Vancouver, BC, Canada, August 29-31, 2009 (pdf).

K. Kourtis, G. Goumas and N. Koziris, "Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication using Index and Value Compression," Proceedings of the International Conference on Parallel Processing (ICPP), Portland, Oregon, USA, 8-12 September, 2008 (pdf).

K. Kourtis, G. Goumas and N. Koziris, "Optimizing Sparse Matrix-Vector Multiplication Using Index and Value Compression," Proceedings of the ACM International Conference on Computing Frontiers, Ischia, Italy, 5-7 May, 2008 (pdf).

G. Goumas, K. Kourtis, N. Anastopoulos, V. Karakasis and N. Koziris, "Understanding the Performance of Sparse Matrix-Vector Multiplication," Proceedings of the 16th Euromicro International Conference on Parallel, Distributed and network-based Processing (PDP2008), Toulouse, France, 13-15 February, 2008 (pdf).

N. Drosinos, G. Goumas and N. Koziris, "Selecting the Tile Shape to Reduce the Total Communication Volume," Proceedings of the 2006 International Parallel and Distributed Processing Symposium (IPDPS2006), Rhodes Island, Greece, 25-29 April, 2006 (pdf).

G. Goumas, N. Drosinos, M. Athanasaki and N. Koziris, "Automatic Parallel Code Generation for Tiled Nested Loops," Proceedings of the 19th ACM Symposium on Applied Computing (SAC 2004), Nicosia, Cyprus, 14-17 March, 2004 (pdf).

N. Drosinos, G. Goumas, M. Athanasaki and N. Koziris, "Delivering High Performance to Parallel Applications Using Advanced Scheduling," Proceedings of the Parallel Computing 2003 (ParCo 2003), Dresden, Germany, September 2003 (pdf).

G. Goumas, N. Drosinos, M. Athanasaki and N. Koziris, "Compiling Tiled Iteration Spaces for Clusters," Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER 2002), Chicago, USA,  September 2002 (pdf).

G. Goumas, N. Drosinos, M. Athanasaki and N. Koziris, "Data Parallel Code Generation for Arbitrarily Tiled Loop Nests," Proceedings of the 2002 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'02), Las Vegas, USA,  June 2002 (pdf).

G. Goumas, M. Athanasaki and N. Koziris, "Automatic Code Generation for Executing Nested Loops onto Parallel Architectures," Proceedings of the ACM Symposium on Applied Computing (SAC 2002)- Parallel and Distributed Systems and Networking Track, Madrid, Spain,  March 2002 (pdf).

G. Goumas, A.Sotiropoulos and N. Koziris, "Minimizing Completion Time for Loop Tiling with Computation and Communication Overlapping," Proceedings of the 2001 International Parallel and Distributed Processing Symposium (IPDPS2001), IEEE Press, San Francisco, California, April  2001 (pdf) (Best paper award).

I. Drositis, G. Goumas, N. Koziris, P. Tsanakas and G. Papakonstantinou, "Evaluation of Loop Grouping Methods based on Orthogonal Projection Spaces," Proceedings of the 2000  International Conference on Parallel Processing (ICPP-2000), pp. 469-476, Toronto, Canada, August 2000, IEEE Press. (pdf).

Workshop papers

V. Karakasis, G. Goumas, K. Nikas, N. Koziris, J. Ruokolainen, and P. Råback, "Using State-Of-The-Art Sparse Matrix Optimizations for Accelerating the Performance of Multiphysics Simulations," PARA 2012: Workshop on State-of-the-Art in Scientific and Parallel Computing, Helsinki, Finland June 10-13, 2012 (pdf).

G. Kouveli, K. Kourtis, G. Goumas and N. Koziris, "Exploring the Benefits of Randomized Instruction Scheduling," Proceedings of the 3rd International Workshop on GCC Research Opportunities (GROW 2011), held in conjunction with CGO 2011, Chamonix, France, 3 Apr. 2011 (pdf).

A. Nanos, G. Goumas and N. Koziris, "Exploring I/O Virtualization Data Paths for MPI Applications in a Cluster of VMs: A Networking Perspective," Proceedings of the 5th Workshop on Virtualization in High-Performance Cloud computing (VHPC 2010), held in conjunction with Euro-par 2010, Ischia - Naples, Italy 31 Aug. - 3 Sep., 2010 (pdf).

G. Rokos, G. Peteinatos, G. Kouveli, G. Goumas, K. Kourtis, and N. Koziris, "Solving the advection PDE on the Cell Broadband Engine," Proceedings of the 2010 International Parallel and Distributed Processing Symposium (IPDPS2010), Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), Atlanta, Georgia, 19-23 April, 2010 (pdf).

N. Anastopoulos, K. Nikas, G. Goumas and N. Koziris, "Early Experiences on Accelerating Dijkstra’s Algorithm Using Transactional Memory," Proceedings of the 2009 International Parallel and Distributed Processing Symposium (IPDPS2009), Workshop on Multithreaded Architectures and Applications (MTAAP), Rome, Italy, 25-29 May, 2009 (pdf).

V. Karakasis, G. Goumas and N. Koziris, "Exploring the Effect of Block Shapes on the Performance of Sparse Kernels," Proceedings of the 2009 International Parallel and Distributed Processing Symposium (IPDPS2009), Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), Rome, Italy, 25-29 May, 2009 (pdf).

I. Riakiotakis, G. Goumas, N. Koziris, F.-A. Metallinou and I. A. Daglis, "Evaluation of Dynamic Scheduling Methods in Simulations of Storm-time Ion Acceleration," Proceedings of the 2008 International Parallel and Distributed Processing Symposium (IPDPS2008), Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), Miami, Florida, USA, 14-18 April, 2008 (pdf).

G. Goumas, N. Drosinos, V. Karakasis and N. Koziris, "Coarse-grain Parallel Execution for 2-dimensional PDE Problems," Proceedings of the 2007 International Parallel and Distributed Processing Symposium (IPDPS2007), Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), Long Beach, California, USA, 26-30 March, 2007 (pdf).