Publications
 
 (2009).  The Manycore Revolution: Will HPC Lead or Follow?.  
 40-49.
 
 (2009).  A View of the Parallel Computing Landscape.  
Communications of the ACM. 52(10), 56-67.
 
 (2009).  A View of the Parallel Computing Landscape.  
Communications of the ACM. 52(10), 56-67.
 
 (2014).  Scalable Multimedia Content Analysis on Parallel Platforms Using Python.  
ACM Transactions on Multimedia Computing. 10(2), 
 
 (2015).  Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling.  
 611-614.
 
 (2016).  A Metaprogramming and Autotuning Framework for Deploying Deep Learning Applications.  
arXiv preprint arXiv:1611.06945.  
 
 (2022).  A Fast Post-Training Pruning Framework for Transformers.  
NeurIPS.  
 
 (2022).  Squeezeformer: An Efficient Transformer for Automatic Speech Recognition.  
NeurIPS.  

 ]
]