Loop and data transformations for sparse matrix code A Venkat, M Hall, M Strout ACM SIGPLAN Notices 50 (6), 521-532, 2015 | 123 | 2015 |
Non-affine extensions to polyhedral code generation A Venkat, M Shantharam, M Hall, MM Strout Proceedings of Annual IEEE/ACM International Symposium on Code Generation …, 2014 | 88 | 2014 |
Automating wavefront parallelization for sparse matrix computations A Venkat, MS Mohammadi, J Park, H Rong, R Barik, MM Strout, M Hall SC'16: Proceedings of the International Conference for High Performance …, 2016 | 70 | 2016 |
Sparse computation data dependence simplification for efficient compiler-generated inspectors MS Mohammadi, T Yuki, K Cheshmi, EC Davis, M Hall, MM Dehnavi, ... Proceedings of the 40th ACM SIGPLAN Conference on Programming Language …, 2019 | 38 | 2019 |
Compiler generation and autotuning of communication-avoiding operators for geometric multigrid P Basu, A Venkat, M Hall, S Williams, B Van Straalen, L Oliker 20th Annual International Conference on High Performance Computing, 452-461, 2013 | 37 | 2013 |
SWIRL: High-performance many-core CPU code generation for deep neural networks A Venkat, T Rusira, R Barik, M Hall, L Truong The International Journal of High Performance Computing Applications 33 (6 …, 2019 | 36 | 2019 |
Towards making autotuning mainstream P Basu, M Hall, M Khan, S Maindola, S Muralidharan, S Ramalingam, ... The International journal of high performance computing applications 27 (4 …, 2013 | 26 | 2013 |
Harnessing deep learning via a single building block E Georganas, K Banerjee, D Kalamkar, S Avancha, A Venkat, ... 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020 | 25 | 2020 |
Misim: An end-to-end neural code similarity system F Ye, S Zhou, A Venkat, R Marucs, N Tatbul, JJ Tithi, P Petersen, ... arXiv preprint arXiv:2006.05265, 2020 | 22 | 2020 |
Synchronization Trade-offs in GPU implementations of Graph Algorithms R Kaleem, A Venkat, S Pai, M Hall, K Pingali IEEE International Parallel & Distributed Processing Symposium (IPDPS 2016), 2016 | 22 | 2016 |
Extending index-array properties for data dependence analysis MS Mohammadi, K Cheshmi, MM Dehnavi, A Venkat, T Yuki, MM Strout Languages and Compilers for Parallel Computing: 31st International Workshop …, 2019 | 19 | 2019 |
Optimizing LOBPCG: Sparse Matrix Loop and Data Transformations in Action K Ahmad, A Venkat, M Hall The 29th International Workshop on Languages and Compilers for Parallel …, 2016 | 16 | 2016 |
ISA mapper: a compute and hardware agnostic deep learning compiler M Sotoudeh, A Venkat, M Anderson, E Georganas, A Heinecke, J Knight Proceedings of the 16th ACM International Conference on Computing Frontiers …, 2019 | 12 | 2019 |
High-performance deep learning via a single building block E Georganas, K Banerjee, D Kalamkar, S Avancha, A Venkat, ... arXiv preprint arXiv:1906.06440, 2019 | 11 | 2019 |
Misim: A neural code semantics similarity system using the context-aware semantics structure F Ye, S Zhou, A Venkat, R Marcus, N Tatbul, JJ Tithi, N Hasabnis, ... arXiv preprint arXiv:2006.05265, 2020 | 9 | 2020 |
Combining polyhedral and ast transformations in chill H Zhang, A Venkat, P Basu, M Hall Proceedings of the Sixth International Workshop on Polyhedral Compilation …, 2016 | 8 | 2016 |
Understanding the performance of small convolution operations for CNN on intel architecture A Heinecke, E Georganas, K Banerjee, D Kalamkar, N Sundaram, ... Poster in the International Conference for High Performance Computing …, 2017 | 7 | 2017 |
Predictive data locality optimization for higher-order tensor computations TR Patabandi, A Venkat, A Kulkarni, P Ratnalikar, M Hall, J Gottschlich Proceedings of the 5th ACM SIGPLAN International Symposium on Machine …, 2021 | 6 | 2021 |
Misim: A novel code similarity system F Ye, S Zhou, A Venkat, R Marcus, N Tatbul, JJ Tithi, N Hasabnis, ... | 6 | 2020 |
Context-Aware Parse Trees F Ye, S Zhou, A Venkat, R Marcus, P Petersen, JJ Tithi, T Mattson, ... arXiv preprint arXiv:2003.11118, 2020 | 4 | 2020 |