Category Talks

RC’12: gnitupmoC

Cong is in Copenhagen presenting our work on synthesizing program inverses for the case of programs with loops, at RC’12. UPDATE: Slides posted (see link below).

  • Cong Hou, Daniel Quinlan, David Jefferson, Richard Fujimoto, Richard Vuduc. “Loop synthesis for program inversion.” In Proc. 4th Wkshp. Reversible Computation (RC), Copenhagen, Denmark, July 2012. [PDF Preprint | PDF slides]

    Scene from Copenhagen - RC'12, by Cong Hou

    Photo by Cong Hou –

Kent on exascale @ ICS’12

Kent is in Venice, Italy this week presenting our 3D FFT exascale projection paper at ICS’12 (Session 6).

  • K. Czechowski, C. McClanahan, C. Battaglino, K. Iyer, P.-K. Yeung, R. Vuduc. “On the communication complexity of the 3D FFT and its implications for exascale.” In Proc. ACM Int’l. Conf. Supercomputing (ICS), San Servolo Island, Venice, Italy, June 2012. doi:10.1145/2304576.2304604 [PDF Preprint | PDF slides]

UPDATE: Kent is joined by fellow Georgia Tech HPC students, Rob McColl and Oded Green, who are presenting their paper on a new parallel algorithm for merging on GPUs [doi:10.1145/2304576.2304621].

UPDATE 2: PDF slides are available (see reference above).

Georgia Tech @ ICS'12 in Venice, Italy

Georgia Tech @ ICS’12: (left-to-right) Kent (HPC Garage) and fellow Georgia Tech’ers, Rob and Oded (of David Bader’s HPC Lab)

ISC’12 + EIHECS’12

Rich is in Germany this week giving invited talks at ISC’12 (Hamburg) and EIHECS’12 (Erlangen) on our theoretical co-design work. The big news at the meeting in Hamburg is the latest Top500 list, which IBM’s Blue Gene/Q machine dominates this year. However, Germany’s SuperMUC system is also making a big splash. The meeting in Erlangen is a cozy local affair, and a great opportunity to meet members of the top international groups in computational science and performance engineering. Many thanks to the hosts at both events!

View of the Elbe River in Hamburg

View of the Elbe River in Hamburg



Aparna, Jee, and Rich are having East-meets-West moments in Shànghǎi (上海) this week for the IEEE Parallel and Distributed Processing Symposium (IPDPS). Aparna and Jee are part of the PhD Forum, and Rich is giving a talk at EduPar-12 and serving as a panelist on the topic of exascale algorithms [IPDPS program].

Neon sign on Romanesque facade on East Nanjing Road in Shanghai

Neon sign on Nánjīng Lù (南京路) in Shànghǎi (上海)


Beacon @ KAUST

Aparna, Jee, and Rich are at the King Abdullah University of Science and Technology (KAUST) in Saudi Arabia this weekend. Well, it’s a work-week for the Saudis. We are presenting and schmoozing with a full slate of amazing attendees of the [Scalable Hierarchical Algorithms for eXtreme Computing] (SHAX-C) workshop. UPDATE: Rich’s talk slides are available [here].

SDM’12 Best Paper

Congratulations to Dongryeol Lee (advisor: Alex Gray) on winning the Best Paper Award at SIAM Data Mining 2012! This paper describes a hybrid distributed/shared memory implementation of kernel summations for high-dimensional data.

  • Dongryeol Lee, Richard Vuduc, Alexander G. Gray. “A distributed kernel summation framework for general-dimension machine learning.” In Proc. SIAM Int’l. Conf. Data Mining, Anaheim, CA, USA, April 2012. Winner, Best Paper. [PDF]

DOE X-PI ’12

This week, Aparna and Rich are at the US Department of Energy-sponsored Exascale Research Conference in Portland, Oregon. UPDATE: See Rich’s position talk slides [here].

PRACE Future Tech ’12

Jeff Vetter (ORNL/GT) and Rich are invited speakers this week at the PRACE-1IP WP9 Future Technologies Workshop in Daresbury, United Kingdom. Rich is presenting the co-design theory that Kent has been developing. UPDATE: See Rich’s slides on Kent’s co-design work [here].


This week, Cong is presenting his work on synthesizing program inverses at the 21st International Conference on Compiler Construction (CC) 2012, in Talinn, Estonia. A program inverse enables speculative parallelization, but also has applications in program debugging and software engineering (e.g., automatically creating “undo” functionality). His paper describes new compiler-based analysis and transformation techniques for the problem, specifically by introducing two new intermediate program representations: a value search graph, which extends single static assignment form to convert the synthesis problem into a graph search problem, and a route graph, which is a subset of the VSG that represents an inverse. Read the gory details below.


Rich is at the 2012 ACM SIGPLAN Symposium Principles and Practice of Parallel Programming (PPoPP) in New Orleans this week to show moral support for Jaewoong Sim and his advisor, Hyesoon Kim, as they present our work on a new performance model for understanding the performance of GPU programs.

  • Jaewoong Sim, Aniruddha Dasgupta, Hyesoon Kim, Richard Vuduc. “GPUPerf: A performance analysis framework for identifying performance benefits in GPGPU applications.” In Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), New Orleans, LA, USA, February 2012. [doi:10.1145/2145816.2145819]