Rajesh Nishtala
Rajesh Nishtala
Verified email at
Cited by
Cited by
Scaling memcache at facebook
R Nishtala, H Fugal, S Grimm, M Kwiatkowski, H Lee, HC Li, R McElroy, ...
10th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2013
Productivity and performance using partitioned global address space languages
K Yelick, D Bonachea, WY Chen, P Colella, K Datta, J Duell, SL Graham, ...
Proceedings of the 2007 international workshop on Parallel symbolic …, 2007
Performance optimizations and bounds for sparse matrix-vector multiply
R Vuduc, JW Demmel, KA Yelick, S Kamil, R Nishtala, B Lee
SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 26-26, 2002
Optimizing bandwidth limited problems using one-sided communication and overlap
C Bell, D Bonachea, R Nishtala, K Yelick
Proceedings 20th IEEE International Parallel & Distributed Processing …, 2006
When cache blocking of sparse matrix vector multiply works and why
R Nishtala, RW Vuduc, JW Demmel, KA Yelick
Applicable Algebra in Engineering, Communication and Computing 18, 297-311, 2007
Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap
R Nishtala, PH Hargrove, DO Bonachea, KA Yelick
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009
Kraken: Leveraging live traffic tests to identify and resolve resource utilization bottlenecks in large scale web services
K Veeraraghavan, J Meza, D Chou, W Kim, S Margulis, S Michelson, ...
12th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2016
Performance without pain= productivity: Data layout and collective communication in UPC
R Nishtala, G Almasi, C Casçaval
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008
Tuning collective communication for Partitioned Global Address Space programming models
R Nishtala, Y Zheng, PH Hargrove, KA Yelick
Parallel Computing 37 (9), 576-591, 2011
Optimizing collective communication on multicores
R Nishtala, KA Yelick
First USENIX Workshop on Hot Topics in Parallelism, 2009
Automatic performance tuning and analysis of sparse triangular solve
R Vuduc, S Kamil, J Hsu, R Nishtala, JW Demmel, KA Yelick
ICS, 2002
Performance modeling and analysis of cache blocking in sparse matrix vector multiply
R Nishtala, RW Vuduc, JW Demmel, KA Yelick
University of California, Tech. Rep. UCB/CSD-04-1335, 2004
Aggregation query under uncertainty in sensor networks
Y Hida, P Huang, R Nishtala
Department of Electrical Engineering and Computer Science. University of …, 2004
System and method for implementing cache consistent regional clusters
YJ Song, P Ajoux, HC Li, J Sobel, S Kumar, R Nishtala
US Patent 9,189,510, 2015
Efficient point-to-point synchronization in UPC
D Bonachea, R Nishtala, P Hargrove, K Yelick
2nd Conf. on Partitioned Global Address Space Programming Models (PGAS06), 2006
Introducing mcrouter: A memcached protocol router for scaling memcached deployments
A Likhtarov, R Nishtala, R McElroy, H Fugal, A Grynenko, ...
Automatically tuning collective communication for one-sided programming models
R Nishtala
University of California, Berkeley, 2009
David Sta ord, Tony Tung, and Venkateshwaran Venkataramani. Scaling Memcache at Facebook
R Nishtala, H Fugal, S Grimm, M Kwiatkowski, H Lee, HC Li, R McElroy, ...
UPC Extended Collective Operations Specification
Z Ryne, S Seidel, PH Hargrove, D Bonachea, R Nishtala
August, 2005
Guest Editorial: Emerging programming paradigms for large-scale scientific computing
L Oliker, R Nishtala, R Biswas
Parallel Computing 37 (9), 499-500, 2011
The system can't perform the operation now. Try again later.
Articles 1–20