publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. SPICE: Structured Pruning for Inference on Constrained Edge Devices
    Subhransu Das, Jiaming Cheng, Aniruddha Rakshit, and 3 more authors
    In IEEE Consumer Communications & Networking Conference (CCNC), 2026
  2. Phase-Wise Analysis of LLM Inference Acceleration on GPU, CPU, and Edge Device
    Subhransu Das, Jiaming Cheng, Swathi Vallabhajosyula, and 2 more authors
    In Practice and Experience in Advanced Research Computing (PEARC ’26), 2026
    To appear
  3. An Empirical Survey of AI Model Compression Techniques for Edge Deployments
    Jiaming Cheng and others
    IEEE Internet of Things Journal, 2026
    Under review (Special Issue on Large Model-Driven Intelligent Computing Optimization in the AIoT); full author list to be finalized

2025

  1. EPIC: Efficient Pruning for Inference on Constrained Devices
    Subhransu Das, Jiaming Cheng, Aniruddha Rakshit, and 2 more authors
    In Practice and Experience in Advanced Research Computing (PEARC ’25), 2025